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(Dated: February 7, 2008) 

We have measured the azimuthal angular correlation of bb production, using 86.5 pb~^ of data 
collected by ColHder Detector at Fermilab (CDF) in collisions at -^=1-8 TeV during 1994—1995. 

In high-energy pp collisions, such as at the Tevatron, bh production can be schematically categorized 
into three mechanisms. The leading-order (LO) process is "flavor creation," where both b and b 
quarks substantially participate in the hard scattering and result in a distinct back-to-back signal in 
final state. The "flavor excitation" and the "gluon splitting" processes^ which appear at next-leading- 
order (NLO), are known to make a comparable contribution to total bb cross section, while providing 
very different opening angle distributions from the LO process. An azimuthal opening angle between 
bottom and anti-bottom, A0, has been used for the correlation measurement to probe the interaction 
creating bb pairs. The distribution has been obtained from two different methods. One method 
measures the Acji between bottom hadrons using events with two reconstructed secondary vertex 
tags. The other method uses bb {J/'ipX){£X') events, where the charged lepton {£) is an electron 
(e) or a muon (/x), to measure A(f> between bottom quarks. The bb purity is determined as a function 
of A(p by fitting the decay length of the J/^p and the impact parameter of the i. Both methods 
quantify the contribution from higher-order production mechanisms by the fraction of the bb pairs 
produced in the same azimuthal hemisphere, /toward- The measured /toward values are consistent 
with both parton shower Monte Carlo and NLO QCD predictions. 



PACS numbers: 13.85.-t,12.38.Qk,14.65.Fy 
Keywords: QCD, heavy quark production 



I. INTRODUCTION 

The dominant b quark production mechanism at the 
Tevatron is believed to be pair production through the 
strong interaction. However, predictions from next-to- 
leading order (NLO) perturbative QCD [1] have so far 
failed to describe the observed b single quark production 
cross section [2-8]. Differential cross section measure- 
ments have also been systematically higher than theo- 
retical predictions [9-12]. Possible explanations for the 
disagreement between the measured and predicted cross 
sections involve improved b fragmentation models [13], 
and non-perturbative bb production mechanisms [14] , and 
supcrsymmetric production mechanisms [15]. 

Studying bb correlations gives additional insight into 



the effective contributions from higher-order QCD pro- 
cesses to b quark production at the Tevatron. For ex- 
ample, the lowest-order QCD bb production diagrams 
contain only the b and b quarks in the final state. Mo- 
mentum conservation requires that these quarks be pro- 
duced back-to-back in azimuthal opening angle, A^, and 
with balanced momentum transverse to the beam direc- 
tion, pt- However, when higher-order QCD processes 
are considered, the presence of additional light quarks 
and gluons in the final state allows the Acf) distribution 
to become more spread out and the b transverse mo- 
menta to become more asymmetric. Previous measure- 
ments of azimuthal correlation distributions have yielded 
varying levels of agreement with NLO predictions [9- 
12, 16]. Additional measurements related to bb produc- 
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tion arc needed to determine whether experimental mea- 
surements are consistent with the Standard Model pic- 
ture of bb production. 

The NLO QCD calculation of bb production includes 
diagrams from each production mechanism up to 0{a^). 
The NLO calculation is the lowest order approach that re- 
turns sensible results because certain classes of diagrams 
which first appear at 0{a'g) often referred to as flavor ex- 
citation and gluon splitting diagrams (see below)-provide 
contributions of approximately the same magnitude as 
the lowest-order diagrams, which are 0{a'g). This contri- 
bution can be midcrstood by considering the cross section 
for gg gg which is approximately two orders of mag- 
nitude larger than the cross section for gg — > bb. Higher- 
order bb diagrams can be formed from the leading-order 
diagram gg gg by adding a g ^ bb vertex to either in 
the initial or final state, but even with the 0{as) sup- 
pression, these higher-order diagrams still provide contri- 
butions that are numerically comparable to the leading- 
order terms [1, 9]. Therefore, higher-order corrections to 
bb production cannot be ignored, and a recent measure- 
ment indicates that the higher order diagrams contribute 
a factor of four above the leading order term [16]. 
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FIG. 1: Example Feynman diagrams that contribute bottom 
production. The bottom two virtual exchange diagrams enter 
into the NLO calculation through interferences with leading- 
order terms. Interferences between the flavor creation, flavor 
excitation, and gluon splitting diagrams, as well as the virtual 
exchange diagrams, are ignored in the parton shower approx- 
imation 

An alternative approach to estimating the effects of 

higher-order corrections is the parton shower model im- 
plemented by the Pythia [17, 18] and Herwig [19] 



Monte Carlo programs'^. The parton shower approach 
is not exact to any order in as but rather tries to ap- 
proximate corrections to all orders by using leading-order 
matrix elements for the hard two-to-two QCD scatter and 
adding addition initial- and final-state radiation using a 
probabilistic approach. In this approximation, the dia- 
grams for bb production can be divided into three cate- 
gories: 

Flavor Creation refers to the lowest-order, two-to-two 
QCD bb production diagrams. This process in- 
cludes bb production through qq annihilation and 
gluon fusion, plus higher-order corrections to these 
processes. Because this production is dominated 
by two-body final states, it tends to yield bb pairs 
that are back- to-back in Acf) and balanced in pt- 

Flavor Excitation refers to diagrams in which a bb pair 
from the quark sea of the proton or antiproton is ex- 
cited into the final state because one of the quarks 
from the bb pair undergoes a hard QCD interaction 
with a parton from the other beam particle. Be- 
cause only one of the quarks in the bb pair under- 
goes the hard scatter, this production mechanism 
tends to produce b quarks with asymmetric pT. Of- 
ten, one of the b quarks will be produced with high 
rapidity and not be detected in the central region 
of the detector. 

Gluon Splitting refers to diagrams where the bb pair 
arises from a g — > 66 splitting in the initial or final 
state. Neither of the quarks from the bb pair partic- 
ipate in the hard QCD scatter. Depending on the 
experimental range of b quark px sensitivity, gluon 
splitting production can yield a 66 distribution with 
a peak at small A^. 

Figure 1 illustrates some lowest-order examples of each 
type of diagram. The general trend is that fiavor creation 
diagrams, being dominated by two-body 66 final states, 
tend to produce back-to-back 66 pairs balanced in px, 
while fiavor excitation and gluon splitting, which neces- 
sarily involve multiparticle final states including a 66 pair 
and light quarks or gluons, produce 66 pairs that are more 
smeared out in A(/) and pr- Categorizing 66 diagrams in 
this scheme becomes ambiguous at higher order in per- 
turbation theory. In the parton shower approximation, 
flavor creation, flavor excitation, and gluon splitting pro- 
cesses can be separated exactly based on how many 6 
quarks participate in the hard two-to-two scatter. Inter- 
ference terms among the three production mechanisms, 



^ Herwig and Pythia use the exact matrix elements for all parton- 
parton two-to-two scatterings. However, all two-to-A^ (A'^ > 2) 
processes arc estimated using tlic "leading-log" approximation, 
which becomes exact in the limit of "soft" or "coUinear" emis- 
sions. As a result, such Monte Carlo programs are often said to 
use the "leading-log approximation." 
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as well as virtual exchange diagrams, are neglected as 
higher-order effects in this approximation. 

Refs. [20] and [f6] show that parton shower Monte 
Carlo programs, which include sizeable contributions 
from the higher-order b production mechanisms of flavor 
excitation and gluon splitting, are able to better repro- 
duce the observed b production cross section. Studying 
bb correlations provides a way to tell whether such large 
contributions from these higher-order processes are sup- 
ported by the data. 

In this paper, we present two new CDF measurements 
of the A0 spectrum in bb production in pp collisions at 
=1.8 TeV. These measurements are made using ap- 
proximately 90 pb~^ of data collected during the 1994- 
1995 Tevatron run (known as Run lb). In addition to 
providing new information about the entire range of the 
bb A(j) spectrum, these analyses are more sensitive than 
previous measurements to the low A(f) region, where fla- 
vor excitation and gluon splitting make a larger contri- 
bution. 

One analysis begins with a sample of events containing 
an 8 GeV electron or muon to enhance the b quark con- 
tent of the sample by taking advantage of the relatively 
high semileptonic B branching ratio. These events are 
then searched for the presence of displaced secondary ver- 
tices indicating the decay of a long lived B hadron, using 
a vertexing algorithm similar to the SECVTX algorithm 
used for the top quark analyses [21, 22]. This analysis re- 
quires that the decay vertices for both B hadrons in the 
event be reconstructed and extracts the B hadron A(f> 
distribution from the A(f> distribution measured between 
the reconstructed secondary vertices. The direction of 
each B hadron is inferred using the vector sum of the 
momenta from the secondary vertex tracks and A(f) is 
defined as the azimuthal angle between the inferred di- 
rections of the two B hadrons. This technique yields a 
high-statistics sample of double-tagged bb events and re- 
tains sensitivity to bb pairs with small opening angles. 
The second analysis detects the presence of b quark de- 
cays in the data entirely through leptonic signatures. The 
decay of one b is tagged by reconstructing the decay of a 
J/ip ^ which provides the trigger signature that 

defines this sample. Events are also required to contain 
an electron or muon consistent with the semileptonic de- 
cay of the second b. This approach does not yield as many 
double-tagged events as the first, but it retains the high- 
est sensitivity for bb production at small opening angles 
and has fewer backgrounds. Both analyses produce con- 
sistent results indicating that roughly one fourth of the 
bb pairs produced in the momentum and rapidity range 
to which these analyses are sensitive have A<j) < 90°. In 
addition, both analyses are at least qualitatively consis- 
tent with the contribution from higher order bb predicted 
by Pythia and Herwig, further supporting the signifi- 
cance of the flavor excitation and gluon splitting produc- 
tion mechanisms at the Tevatron. 



II. DETECTOR 

The CDF detector has a cylindrical symmetry about 
the beam-line, making it convenient to use a cylindri- 
cal coordinate system with the z-axis along the proton 
beam direction. We define r to be the distance from 
the beam-line and 4> to be the azimuthal angle measured 
from the direction pointing radially outward in the plane 
of the Tevatron ring. It is also useful to use the polar 
angle 6 measured with respect to the z-axis, and pseu- 
dorapidity ry = —In (tan (6/2)). In the approximation of 
massless particles, the pseudorapidity equals the rapidity 
y = (l/2)ln {{E +pz)/{E — p^)), which is the invariant 
boost of the particle along the z-axis. The CDF detector 
is described in detail elsewhere [23] . In the following, we 
focus on the elements most relevant to these analyses. 



FIG. 2; Schematic of a quarter cross-section of the CDF Run 
lb detector. 

The tracking system, consisting of three different sub- 
detectors, the central tracking chamber (CTC) [24], the 
vertex detector (VTX), and the silicon vertex detector 
(SVX') [25], is immersed in a uniform 1.4 T solenoidal 
magnetic field in order to measure the charged particle 
momentum in a plane transverse to the z-axis, denoted 
as pt — P sin 9. A charged track reconstruction be- 
gins with the measurements made in the CTC, which 
is a large cylindrical multi-wire drift chamber in 3.2 m 
length along the z-axis and centered at the nominal in- 
teraction point of the CDF detector. It contains a total 
of 84 layers of wires positioned between r = 31 and 133 
cm. The layers are arranged in the alternating groups of 
12 with wires parallel to the z-axis, known as axial su- 
perlayers, and 6 with wires in ±3° stereo angles, known 
as stereo superlayers. The VTX, sitting inside the in- 
ner radius of the CTC, is a time projection chamber that 
provides a precise particle trajectory measurement in the 
r—z plane and ultimately allows the determination of the 
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z-location of the primary interaction point. The inner- 
most system is the SVX' covering from r — 2.9 to 8.1 cm. 
This four-layer detector aUows high precision determina- 
tion of particle trajectories in the r— </> plane. Combined, 
the whole tracking system provides a pt resolution of 
Spt/pt = [(0.0009 X pt)^ + (0.0066)2)] 1/2 .^^^ impact 
parameter resolution of ScIq = [13 + (40 GeV/c)/j;T] /^ni. 

The central electro-magnetic calorimeter (CEM) [26], 
located outside the radius of the CTC and segmented 
in a projective tower geometry, is designed to be deep 
enough to contain electro-magnetic showers initiated by 
electrons or photons. The CEM consists of alternating 
layers of lead absorber and polystyrene scintillator. A 
set of wire and strip tracking chambers, known as CES, 
are embedded in the CEM near the shower maximum 
or depth of greatest energy deposition, to measure the 
transverse shower profile. An electron is identified from a 
track reconstructed in the tracking system that points to 
an energy deposition in the CEM of appropriate size and 
matches to a cluster in the CES. For more penetrating 
particles, the central hadronic calorimeter (CHA) [27] is 
located behind the CEM. The CHA is constructed from 
alternating layers of steel absorber and scintillator, and 
also segmented in a projective tower geometry. The CHA 
is used in these analyses primarily to reject hadrons that 
might fake an electron or a muon signature. 

Muons are detected by their ability to penetrate the 
material in the calorimeter. Three sets of chambers are 
positioned outside the CHA to identify muons. The first 
set, known as the central muon (CMU) [28] is located at 
r = 3.47 m. A particle traveling perpendicular to the z- 
axis from the primary interaction point must traverse 5.4 
pion interaction lengths of material to reach the CMU. 
An additional set of chambers, the central muon upgrade 
(CMP) [29] is arranged in a rectangular array around 
the CMU behind an additional 60 cm of steel shield- 
ing to provide further discriminating power between real 
muons and hadronic punch-through. To penetrate to 
the CMP, a particle traveling perpendicular to the 2;-axis 
from the primary interaction point has to pass through 
8.4 pion interaction lengths of material. The CMU and 
the CMP detectors cover |?7| < 0.6. Another set of cham- 
bers, the central muon extension (CMX), consisting of 
four arches of drift chambers located behind 6.2 pion in- 
teraction lengths of material, covers 0.6 < jryj < 1.0. In 
addition, the CMX drift tubes are sandwiched between 
two layers of scintillator that provide fast timing infor- 
mation to the trigger. Segments reconstructed from hits 
in the chambers are known as "stubs" . 

The CDF uses a three-level trigger system. The first 
two levels, named Level-1 and Level-2, are implemented 
in hardware and reduce the data rate from the full 300 
kHz beam crossing rate to a more manageable 20 Hz. The 
third level, named as Level-3, consists of software algo- 
rithms that run a stream-lined version of the full CDF 
reconstruction software. The triggers used for these anal- 
yses rely on Icpton identification through matching en- 
ergy deposition in the CEM (for electron) or muon hits 



in the CMU, the CMP, and the CMX (for muon) with 
charged particle tracks reconstructed in the CTC. 



III. SECONDARY VERTEX TAG B HADRON 
CORRELATION ANALYSIS 

A. Overview 

The Afp distribution of two reconstructed secondary 
vertex tags has been obtained from data as a probe to 
investigate the bb production mechanisms and compared 
to the predictions based on Pythia and Herwig Monte 
Carlo (MC) programs. We correct our data for detector 
effects and background contributions using MC informa- 
tion in order to extract the At/) distribution of B hadrons 
that can be directly compared to the theoretical predic- 
tions. We choose to measure the A(f) distribution of B 
hadrons rather than b quarks, since our secondary ver- 
tex tags are more directly related to B hadrons than b 
quarks. Converting our measurement from the B hadron 
level to the b quark level would introduce a dependence 
on b quark fragmentation models that we wish to avoid. 

This analysis uses the largest sample of double-tagged 
B hadron decays ever collected at a hadron collider, 
extracted from the data taken by CDF during the 
1994-1995 run of the Tevatron (Run lb). To create a 
sample enhanced in b quark content, we take advantage of 
the high purity of CDF lepton triggers as well as the sig- 
nificant impact parameters of B decay daughters. Each 
candidate event is required to contain a lepton, either an 
electron or a muon, presumably coming from the semilcp- 
tonic decay of one B hadron, and the displaced secondary 
vertices of both B hadrons. After background removal, 
we obtain a sample of approximately 17,000 events. 

B. Secondary Vertex Tagging 

Our secondary vertex tagging algorithm looks for 
tracks consistent with coming from a secondary vertex, 
significantly displaced from the primary vertex, using the 
precise tracking information. This algorithm is based on 
the BVTX algorithm used for the B^ — B^ mixing anal- 
ysis [30], which is a modified version of the SECVTX 
algorithm used for the top quark analysis [21, 22]. The 
main difference between the version of the BVTX used 
here and the version used for the previous CDF analyses 
is the ability to locate more than one secondary vertex 
per jet searched. For extensive details on the BVTX and 
the modifications made for this analysis, see Refs. [30] 
and [31]. Below we summarize the secondary vertex 
finding approach. 

The secondary vertex finding begins by first locating 
the primary interaction vertex for the event using the 
precise tracking information. Next the tracks in the 
event passing quality cuts are grouped into jets using 
a cone-based clustering algorithm with a cone size of 
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AR = ^(A(^)2 + (A77)2 = 1.0. Each jet is then searched 
for the presence of one or more secondary vertices dis- 
placed from the primary. Because the secondary vertex 
finding is done on a jet-by-jet basis, this algorithm is not 
able to handle the case where the B decay products are 
contained in more than one jet. However, the relatively 
large cone-size used in this analysis was chosen to reduce 
the number of times the a B decay would span more than 
one jet. The secondary vertex finding is done in two steps 
for each jet. The first step finds secondary vertices con- 
taining at least three tracks. When the first step fails to 
find any more secondary vertices in a jet, the second step 
is attempted in which the individual track cuts arc made 
more stringent and two-track secondary vertices arc ac- 
cepted. Each secondary vertex found is required to be 
significantly displaced from the primary and not to be 
consistent with the decay of a Kg or A. 



C. Sample Selection 

This analysis starts with the data sample used for the 
measurement of time dependent B^ — 5° mixing [30], 
which is a loosely selected sample that requires each event 
to have at least an electron or a muon with pT > 8 GeV/c 
identified using the standard CDF lepton identification 
cuts [31], and at least one reconstructed secondary ver- 
tex. This sample is known as the BVTX sample, af- 
ter the name of the secondary vertex tagging algorithm 
used to create it. The BVTX sample consists of over 
480,000 electron-triggered events and over 430,000 muon- 
triggered events. 

The strategy for extracting candidates from the BVTX 
sample is as follows: Because the BVTX sample was col- 
lected with a number of different lepton triggers, we im- 
pose specific trigger requirements to ensure the electron 
and muon subsamples have comparable kinematic prop- 
erties. Next, the data sample is reprocessed by the mod- 
ified version of the BVTX algorithm (see section IIIB 
above) and each event is required to contain at least two 
secondary vertex tags. The separation L^y between each 
secondary vertex and the primary vertex in the plane per- 
pendicular to the beam-line divided by the uncertainty on 
the measurement {cL^y) is required to be L^y/cTL^y > 2. 
To reduce the chance of tagging the same B decay with 
two poorly measured tags, the 2-dimensional separation 
between the secondary vertex tags is also required to 
be \ALxy\/(7AL^y > 2. AL^y is defined to be the dis- 
tance between the two secondary vertex tags as mea- 
sured in the plane perpendicular to the beam. Each tag 
pair is required to have an invariant mass greater than 
6 GeV/c^ to reduce the chance that a tag pair results 
from a, B ^ D X decay chain. For a tag pair failing 
either the \ ALxy\/(JAL^y or the invariant mass cuts, only 
the tag with the longest 2-dimensional separation from 
the primary vertex is removed. Finally, since the trigger 
requirements for this sample assume at least one of the 
B hadrons decay semileptonically, the trigger lepton is 



required to be within a cone of AR = 1.0 of one of the 
vertices. 



D. Sample Composition 

We have isolated a high purity bb sample in sec- 
tion IIIC with small contamination from other sources. 
Table I shows the sample composition, including back- 
ground sources that make a contribution to the sample. 
We briefly summarize each background contribution be- 
low. 

A mistag happens when the secondary vertex tagging 
algorithm tries to fit a vertex from a set of tracks that 
do not physically originate from a common vertex. Due 
to errors caused by the tracking performance, it is possi- 
ble to find a set of prompt tracks that seem to intersect 
at a vertex displaced from the primary vertex. These 
vertices distort the correlation spectrum and must be re- 
moved. One way to identify mistags is by looking at 
the distribution of L^y, which is signed based on the in- 
ferred direction of the particle, namely the direction of 
the secondary vertex, relative to the primary vertex. A 
particle that seems to be moving out from the primary 
vertex at the time of decay obtains a positive Lxy, while 
a particle that seems to have been moving towards the 
primary vertex gets a negative L^y- A particle is deemed 
to be moving away from the primary vertex if the angle 
between the tag displacement vector (measured from the 
primary vertex to the secondary vertex tag) and the tag 
momentum vector is less than 90°, and towards the pri- 
mary vertex otherwise. A secondary vertex correspond- 
ing to the decay of real long-lived particle is expected to 
have a positive L^y However, the finite resolution of the 
tagging algorithm can yield a negative contribution. As 
a consequence, mistags make an L^y distribution that 
is symmetric about zero. We make use of this feature 
of mistags to subtract them statistically from the data. 
To understand better how the L^y distribution is used 
for mistag subtraction, consider the case of an analysis 
involving only single tags. Half of the total mistag back- 
ground appears in the negative L^y region. The positive 
portion of the L^y distribution contains the other half of 
the mistags, as well as real secondary vertex tags. There- 
fore, by subtracting twice the number of negative L^y 
tags from the entire sample of tags, we are left with only 
the good secondary vertex tags. For analyses such as this 
one, which considers pairs of secondary vertex tags, the 
calculation is given by 

Ngg = N++ - iV+_ + (1) 

where Nqg is the estimated number of tag pairs in which 
both tags arc legitimate secondary vertex tags, while 
Af++ is the number of tag pairs in which both tags have 

Lxy > 0, A''_| and are the number of tag pairs in 

which one tag has a positive L^y and the other has a neg- 
ative Lxy, and N is the number of tag pairs in which 
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Scenario 



Classification 



The tracks in the tag are from the same B decay (including any tracks 
from a secondary D decay) 

The tag contains random prompt tracks not associated with the decay 
of any long-lived particle 

The tracks in the tag arc from a B decay (including secondary D decay) 
that has already been tagged with other tracks. 

The tag tracks are from a prompt D decay-in other words, a D not 
associated with the decay of a B. 



Good Tag (Signal) 

Mistag (Background) 

Sequential Double- Tag (Background) 

Prompt Charm (Background) 



TABLE I: Different sources of tags and their classification as signal or background. 



both tags have L^y < 0. ConccptuaUy, in this equa- 
tion, we are using the second and third terms to subtract 
mistags from the tag pairs represented by the first term. 

However, and N y each contain a contribution from 

the case where both tags arc mistags and by subtracting 
them both, this contribution is double-counted. The last 
term in the equation corrects this. To obtain a mistag 
subtracted distribution, Eq. 1 is appUed on a bin-by-bin 
basis. 

Another possible source of background involves tag- 
ging more than one secondary vertex from a single B 
decay. These tags, known as sequential tags, are most 
likely to occur when the B decay involves the produc- 
tion of a hadron that travels a certain distance from 
the B decay vertex before itself decaying. The invariant 
mass cut of 6 GeV/c^ eliminates virtually all contribu- 
tion from this source. Although this cut does reduce the 
tagging efficiency at low opening angle, it is necessary 
to keep the sequential tag background from overwhelm- 
ing the signal in that region. This efficiency reduction is 
accoimtcd for in the Monte Carlo modeling of the data. 
It is also possible that some sequential tag pairs arise 
from tracking errors that cause tracks originating from a 
common vertex to be reconstructed as coming from two 
vertices that are very close together. The cut on the 
significance of the 2-dimensional separation between the 
tags {\ALxy\/aAL^y) eliminates nearly all these tag pairs. 
The residual contribution from sequential double tags is 
estimated in section IIIG. 

Finally, a background source of legitimate secondary 
vertices is direct cc production. In general, most D 
hadrons have a much smaller lifetime than B hadrons. 
However, those D hadrons that do live long enough to 
produce a secondary vertex capable of being tagged by 
BVTX will not be removed or accounted for by any of 
the methods mentioned above. In addition, it is possi- 
ble to have events in which multiple heavy flavor pairs, 
such as bb + cc and bb + bb, are produced. For example, 
in a flavor creation event, an additional cc pair may be 
produced through gluon splitting. In such events it is 
possible for the bb to contribute one tag and the cc to 
contribute another. Although the rate of multiple heavy 
flavor production is much lower than single bb produc- 
tion, the opportunity to tag more displaced vertices in a 
given event can provide an enhancement in tagging ac- 



ceptance, meaning such processes cannot be discounted 
outright. Our MC studies indicate that the combined 
contribution to the tag pair sample from prompt charm 
and multiple heavy flavor production is not large, roughly 
10%. The subtraction of this contribution and the asso- 
ciated systematic error are described in section IIIG. 



E. Monte Carlo Samples 

The parton shower MC programs, Pythia [17] and 
Herwig [19], are used to generate large samples of bb 
events. Because flavor creation, flavor excitation, and 
gluon splitting mechanisms do not interfere with each 
other in the parton shower model, each mechanism is gen- 
erated separately. For Pythia, the flavor creation sam- 
ples are generated as the heavy flavor production process 
using massive matrix elements for qq — > bb and gg — > bb 
diagrams. Flavor excitation and gluon splitting samples 
are produced as the generic QCD 2-to-2 process using 
massless matrix elements, and then separated from other 
QCD processes by examining the partons that participate 
in the 2-to-2 hard scattering. Three Pythia samples 
with different amounts of initial state radiation (ISR) are 
generated for each mechanism: The samples are dubbed 
low, medium, and high ISR, as explained in appendix A. 
The choice to investigate different ISR settings in Pythia 
is motivated primarily because the ISR tuning of Pythia 
was changed in the recent past based on studies of heavy 
flavor production [32, 33], and the new timing produces a 
noticeably different A(p spectrum from the previous ver- 
sion. The low ISR sample corresponds to the most recent 
ISR settings in Pythia while the high ISR sample re- 
flects the previous default settings. In principal, changes 
in the amount of final state radiation (FSR) would have 
a similar affect, but such an effect has not been studied 
here. For all three Pythia samples, the underlying event 
is tuned to match observations in CDF data [34] . On the 
other hand, because there are fewer parameters to tune, 
only one Herwig sample is generated for each mecha- 
nism. The Herwig flavor creation and flavor excitation 
samples are generated with heavy flavor production op- 
tion including massive matrix element treatments of the 
LO flavor creation and flavor excitation diagrams. As in 
Pythia, the Herwig gluon splitting component results 
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from generating all QCD 2-to-2 processes using masslcss 
matrix elements and retaining those events classified as 
gluon splitting based on the partons involved in the hard 
scattering. In addition, a small sample of cc events was 
generated using Pythia, for the purpose of evaluating 
the possible effects of residual prompt charm as a back- 
ground for this analysis. Both Pythia and Herwig gen- 
eration used the CTEQ5L parton distribution functions. 
Sec Appendix A for more information about the Pythia 
and Herwig parameters used for this analysis. For all 
samples, heavy flavor decays are handled by the CLEO 
QQ MC program [35]. Finally, to make the MC data 
resemble the actual data as closely as possible, the MC 
events are passed through a CDF detector simulation, a 
CDF trigger simulation, and the same reconstruction and 
analysis code used for the actual data. Additional details 
regarding the generation of MC samples can be found in 
Ref. [31]. 



F. Data— Monte Carlo Comparison 

After the Monte Carlo samples have been passed 
through the detector simulation as described above, the 
Monte Carlo predictions for the secondary vertex tag dis- 
tributions can be compared directly with data. Distri- 
butions involving individual tags have similar shapes for 
flavor creation, flavor excitation, and gluon splitting, and 
so these distributions can be used to check whether the 
detector simulation adequately models detector effects. 
Distributions involving tag pairs, and therefore correla- 
tions, give information about how well the Monte Carlo 
models describe bb production. 

Figure 3 shows the comparison of the trigger electron 
Px and Ex and the trigger muon between the data and 
each Monte Carlo sample. Figure 4 shows the compari- 
son of the secondary vertex tag properties in the Monte 
Carlo and data. In each case, the agreement between 
the measured spectrum from the data and the predicted 
spectra for each Monte Carlo sample indicates that the 
effects of trigger and reconstruction thresholds are ade- 
quately modeled in the simulation. In addition, examin- 
ing the Monte Carlo truth information for the B hadrons 
tagged by the analysis code allows a determination of the 
effective minimum B pt sensitivity for this analysis. For 
the B producing the 8 GeV/c trigger lepton, this mea- 
surement is sensitive only to B hadrons with a minimum 
Pt of 14 GeV/c, while the requirement that the other B 
be tagged by the BVTX algorithm sets a minimum pt 
acceptance of 7.5 GeV/c pr- 

Comparing tag pair correlations between the Monte 
Carlo samples and the data reveals whether Pythia or 
Herwig provide an adequate model of the higher-order 
contributions to bb production. This analysis focuses on 
the transverse opening angle, A0. For tag pairs, A(/> is 
defined as the angle between the pt vectors determined 
by taking the vector sum of the px from all the tracks 
involved in the tag. The A(p distribution is interesting to 
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FIG. 3: The trigger eleetron pr and Et and the trigger muon 
Pt distributions from data compared to the high ISR Pythia 
Monte Carlo sample. The comparisons of data to other Monte 
Carlo samples is similar and can be found in Ref. [31]. The 
muon trigger threshold is clearly visible in the lower plot. The 
small number of muons below the 8 GeV trigger threshold 
come from events containing a second muon that passes the 
offline selection. 



study because it is sensitive to contributions from flavor 
excitation and gluon splitting. Also, the broadness of 
the back-to-back peak in A(f> is sensitive to the amount 
of initial-state radiation present in the Monte Carlo. It 
should be noted that the shape of the Acf) distribution 
and the relative contributions from the three production 
mechanisms depend on the pt cuts placed on each of the 
B hadrons. 

There are two possible approaches to normalizing the 
relative contributions in Monte Carlo from flavor cre- 
ation, flavor excitation, and gluon splitting. Pythia and 
Herwig each provide predictions for the cross section 
of each production mechanism, and these cross sections 
can be used to normalize their contributions relative to 
one another. Alternatively, one could take the position 
that Pythia and Herwig may not correctly model the 
amount of each contribution, and the relative contribu- 
tions should be determined to provide the best match to 
data. In this analysis, both approaches are examined. 
As described in the sections below, the data is compared 
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FIG. 4: The secondary vertex tag distributions from electron 
data compared to the high ISR Pytiiia Monte Carlo sample. 
Comparisons involving muon data and comparisons to other 
Monte Carlo samples are similar and can be found in Ref. [31]. 

to the Monte Carlo predictions in two ways. First, the 
Monte Carlo prediction for the cross section of each pro- 
duction mechanism is used to normalize the flavor exci- 
tation and ghion splitting components relative to the fla- 
vor creation contribution. In this "flxed normalization" 
scheme, the data is compared to the Monte Carlo using 
one arbitrary global normalization parameter. The arbi- 
trary global normalization is included because this anal- 
ysis attempts only a shape comparison, not an absolute 
cross section measurement. In addition, the Monte Carlo 
and data are compared using a "floating normalization" 
scheme. In this comparison, each production mechanism 
is given an independent arbitrary normalization constant 
and the three normalizations are varied to yield the best 
match to data. 

Figure 5 shows the comparison of the BVTX tag A(/) 
distribution from data to the distribution predicted by 
each Monte Carlo sample when the relative normaliza- 
tion of each production mechanism is based on the Monte 
Carlo prediction for the cross sections of the different pro- 
duction mechanisms. From these /S.cf) comparisons it can 
be seen that each Monte Carlo model matches the qual- 



itative features of the data, although there are definite 
differences in shape, as reflected by the poor values. 
For the Pythia sample with low initial-state radiation 
(ISR), the peak in the back- to-back region is too nar- 
row, while for the medium and high ISR samples, the 
back-to-back peak is too broad. Similarly, the Herwig 
Monte Carlo sample also has a peak that is too broad at 
high A(/), perhaps even more so than in Pythia. How- 
ever, aside from these discrepancies at high A(/), the rest 
of the A0 distribution matches reasonably well between 
Monte Carlo and data using the normalizations predicted 
by the Monte Carlo generators for the different produc- 
tion mechanisms. The values between the A(/) curves 
from Monte Carlo and data are listed in Table II. On 
the basis of these values, it appears that Pythia 
with the medium ISR value provides the best match to 
data when using the Monte Carlos default normalization 
for the three production mechanisms. Figure 6 shows 
the breakdown of the contributions from the individual 
production mechanisms to the overall A^ shape for this 
Pythia sample. 

However, since, in the parton shower approximation, 
the contributions from flavor creation, flavor excitation, 
and gluon splitting may be generated separately, each 
component can have a separate, arbitrary normalization 
and the three components can be fit for the combination 
of normalizations that gives the best match to the shape 
of the A^ spectrum from data. These fits are shown in 
Figure 7. When the normalizations of the individual com- 
ponents are allowed to float with respect to one another, 
one can obtain rather good agreement in shape between 
data and both the low ISR and high ISR Pythia sam- 
ples. The fit of the low ISR Pythia Monte Carlo to the 
data increases the broader contribution from fiavor exci- 
tation to compensate for the narrowness of the back-to- 
back peak from flavor creation. For the high ISR Pythia 
samples, the peak at high Aq!) is made narrower to match 
the data by all but eliminating the contribution from fla- 
vor excitation. A comparison of the relative fractions 
of each production mechanism in the two Pythia flts is 
shown in Figure 8. The fit of the Herwig sample to the 
data also tries to compensate for the excessive broadness 
of the Herwig flavor creation peak at high At/), but even 
after completely eliminating the flavor excitation contri- 
bution, the remaining contribution from flavor creation 
at high A^ is too broad to model the data. Table II 
compares the fit quality and effective contribution from 
fiavor creation, flavor excitation, and gluon splitting in 
the fits of the various Monte Carlo samples to the data. 
Both low ISR and high ISR Pythia samples can be made 
to fit the data with approximately the same fit quality, 
which is unexpected, especially since the low ISR sample 
accomplishes this fit with a high fiavor excitation content 
while the high ISR sample fits with almost no fiavor ex- 
citation contribution. In the end, there seems to be an 
ambiguity in Pythia that allows a trade-off between ini- 
tial state-radiation and the amount of flavor excitation. 

In general, any of the Monte Carlo samples com- 
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TABLE II: Comparison of the effective contributions from flavor creation (FC), flavor excitation (FE), and gluon splitting (GS) 
to fits of the Monte Carlo A0 to the data. The fit takes into account Monte Carlo statistics in addition to errors on the 
data. The "x^ probability" entry refers to the probability of getting a worse fit, according to the x^ distribution. 



pared to the data shows reasonable qualitative agree- 
ment. The Monte Carlo sample that best matches the 
data is Pythia with medium or high ISR settings, when 
the individual normahzations of the flavor creation, flavor 
excitation, and gluon sphtting are aUowed to float sepa- 
rately to best fit the data. Although the fit using Pythia 
with low ISR is not so poor as to rule this model out com- 
pletely, studies indicate that Pythia with high initial 
state radiation does a better job of matching both the 
underlying event and minimum bias data at CDF [34]. 
Therefore, we select the Pythia sample, with high ISR 
and the relative normalizations of flavor creation, flavor 
excitation, and gluon splitting fixed by our fit to the A(/) 
distribution of the data, as the best Monte Carlo model 
of the data. Comparisons indicate that the differences 
between Pythia with medium or high ISR settings are 
minor. Figure 9 shows a comparison of other correlations 
between the data and Pythia with high ISR. Although 
these plots show good agreement between the data and 
Pythia for the overall shapes of the distributions, the 
shapes of the individual contributions from flavor cre- 
ation, flavor excitation, and gluon splitting arc not dis- 
tinct enough to allow a separation of the components as 
was done for the distribution. 

It is interesting to note that before allowing the nor- 
malizations of each production mechanism to float in the 
fits, the agreement between Herwig and the data is no 
worse than the agreement between the low ISR Pythia 
sample and the data. However, because the disparity be- 
tween the data and the low ISR Pythia sample comes 
from the narrowness in the flavor creation peak at high 
A</), when the normalizations are allowed to float, the 



fit can alleviate the disagreement by increasing the peak 
width through a higher contribution from flavor excita- 
tion. In contrast, for Herwig, once the contribution 
from flavor excitation has been reduced to zero, the fit 
has no way to make the width of the back-to-back flavor 
creation peak smaller, short of the unphysical situation 
of setting the flavor excitation normalization negative. 
If there were some other parameter for Herwig, like 
Pythias initial state radiation parameter, PARP(67), 
that could be used to tune the width of the back-to-back 
flavor creation peak, it may be; possible to achieve good 
agreement between Herwig and the data as well. 



The results presented here can be compared to another 

analysis of lepton tags in heavy-flavor events presented 
in Ref. [16]. That analysis compares Herwig to double- 
tagged events using higher Et jet samples. In addition to 
using a sample of double-tagged events at higher momen- 
tum, Ref. [16] also differs from this analysis in that it uses 
calorimeter based jets as opposed to the tracking jets uti- 
lized here, and the quantity measured is the azimuthal 
opening angle between the tagged jets rather than the 
angle between the tags themselves. In agreement with 
this analysis, that one clearly shows the importance of 
the higher-order contributions in heavy flavor produc- 
tion, and also shows an agreement with Herwig that is 
better than the agreement seen in this analysis. Perhaps 
this suggests that the disagreement shown here between 
Herwig and the data is related to Herwigs ability to 
model low pt ^ production or h fragmentation. 
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FIG. 5: Comparisons between the shape of the tag A<j) distri- 
bution for data (points) and the tag /S.<j) distribution predicted 
by the various Monte Carlo samples (line) . The contributions 
from flavor creation, flavor excitation and gluon splitting are 
added together according to the individual Monte Carlo cross 
section predictions for these contributions and only a single 
common normalization is varied to get the best match to data. 
The values shown in the plots account only for the data 
and Monte Carlo statistical errors. 



G. Corrections and Systematics 

The correlations examined so far in the data involve 
pairs of BVTX tags, rather than pairs of B hadrons. 
There are detector effects, such as the tagging efficiency 
for pairs of B hadrons as a function of At/), that distort 
the shape of the measured tag pair correlations from the 
true B hadron distribution. In addition, residual contri- 
butions from background can affect the shape of the tag 
pair distribution. For the comparison between MC and 
data, the detector effects are accounted for by using a de- 
tector and trigger simulation to adjust the MC to match 
the conditions in the data, while the backgrounds are as- 
sumed to be negligible. However, since the MC models 
examined in section III F match the data reasonably well, 
MC events can be used to determine the relationship be- 
tween the measured tag pair distribution and the actual 
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FIG. 6: A detailed comparison between the /S.<j) distribution 
from data (points, statistical errors only) and the A(/!> distribu- 
tion from Pythia with medium ISR (solid line). In addition, 
the contributions from flavor creation (dashes), flavor excita- 
tion (dots), and gluon splitting (dash-dots) are shown. The 
contributions are normalized according to Pythia's cross sec- 
tion predictions and an arbitrary global normalization is used 
to give the best shape fit between data and Monte Carlo. 

Note that mistag subtraction applied to the individual 
Pythia contributions can result in negative values for bins 

with few entries. Consequently, the total Pythia 
distribution can be less than one of the components in some 

bins. 

B hadron distribution. In the sections below, two kinds 
of corrections to the tag pair A(/) distribution are con- 
sidered: a correction for the relative tagging efficiency, 
which is a detector effect, and a correction for the contri- 
butions from mistags, prompt charm, and sequential de- 
cays that remain in the data after the steps taken in sec- 
tion III D to remove backgrounds. In addition, the MC is 
used to estimate the systematic uncertainties associated 
with correcting for the relative tagging efficiency and re- 
moving background events. These corrections and sys- 
tematic errors are evaluated using several different MC 
samples to account for uncertainties involved in the MC 
model itself. 

The BVTX tagging algorithm is not equally effective 
for all topologies of bb production. In particular, it be- 
comes more difficult for the BVTX algorithm to recon- 
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FIG. 7: Comparisons between the shape of the tag distri- 
bution for data (points) and the tag /S.<j) distribution predicted 
by the various Monte Carlo samples (line). In these compar- 
isons, the normalizations of each production mechanism were 
allowed to vary independently and were chosen to give the 
best fit between the Monte Carlo and the data. Again, the 
fit takes into account Monte Carlo statistics in addition to 
errors on the data. 



struct both displaced secondary vertices as the opening 
angle between the two B hadrons decreases. This ef- 
fect becomes especially severe when the two B hadrons 
are both contained within the cone of a single jet for 
track clustering purposes. Furthermore, correlations be- 
tween opening angle and pt{B) for the various produc- 
tion mechanisms can lead to differences in the relative 
efficiency for reconstructing tag pairs at different open- 
ing angles. These effects distort the shape of the A(/) 
distribution measured for tags from the true BB A0 dis- 
tribution. 

We correct for these relative efficiency effects using the 
MC that best matches the data, as determined in sec- 
tion III F. Because we are only examining the shape of 
the A(/) distribution, our goal in making this correction 
is only to account for differences in the relative efficiency 
of the tagging algorithm, as a function of A0. We do not 
attempt to correct for effects that impact all parts of the 
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FIG. 8: A comparison of the contributions from flavor cre- 
ation (dashes), flavor excitation (dots), and gluon splitting 
(dash-dots) to the total Acp shapes (solid) for Pythia with 
high ISR (top) and low ISR (bottom). Electron Monte Carlo 
is shown in the plots. The muon plots can be found in 
Ref. [31]. The normalization of each component is set by 
the best fit of the three components to the spectrum from 
data. Note that mistag subtraction applied to the individual 
Pythia contributions can result in negative values for bins 
with few entries. Consequently, the total Pythia distribu- 
tion can be less than one of the components in some bins. 

A(f) spectrum equally. For example, an overall shift in the 
muon trigger efficiency would not affect this correction. 
To determine the correction for each bin we take the ra- 
tio of the number of tag pairs reconstructed in the MC to 
the number of pairs that could have been reconstructed 
if the tagging algorithm had perfect efficiency. The num- 
ber of tag pairs that would have been reconstructed as- 
suming perfect efficiency is determined by looking at the 
generator level B hadron Ac/) distribution. For electron 
MC, to simulate the electron trigger, we require one B 
hadron in the event to have a pt > 14.0 GeV/c and 
1 77 1 < 1.0. For the muon MC, we demand one B hadron 
with Pt > 14.0 GeV/c and \r]\ < 0.6. For both cases, 
we require a second B hadron with pT > 7.5 GeV/c and 
|?7| < 1.0. The cuts placed on the generator-level MC 
were determined by examining the px and rj distributions 
for B hadrons from MC events in which two BVTX tags 
were reconstructed. The px and t] values were chosen by 
determining the cuts for which 90% of the B hadrons in 
the double-tagged MC events would pass. We take the 
A(j) distribution resulting from the event selection above 
and convolute it with a Gaussian resolution function with 
a width of 0.1086 radians, characteristic of the A^ res- 
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FIG. 9: A comparison of Pythia with high ISR to the data for 
several different correlation quantities. The normalizations 
for the three production mechanisms in Pythia have been 
determined by the fit of the Pythia A(j> distributions to data. 
The pt asymmetry is given by Apj, = (prilep) — pT(non — 
lep) / (prilep) + prinon — lep) . In the pT^Non — LepTag) 
plot, the sign of the pt is determined by the opening angle 
between the lepton-tag and the non-lepton tag: negative for 
tag pairs with At/) < 90°, positive otherwise. The data are 
shown as points with statistical error bars only. The solid line 
is Pythia with high ISR 



case of finding secondary vertex tags in two separate jets 
to finding secondary vertex tags in the same jet. Since 
we are only interested in the effect of the efficiency on 
the shape of the A0 distribution, and not on its absolute 
normahzation, we have rescalcd the curve in Fig. 10 so 
that the relative efficiency in the last A0 bin is defined to 
be unity. Thus this curve shows the effect of the BVTX 
tagging efficiency for a given bin relative to the last A(/) 
bin. 

PYTfflA (CTEQ5L), High ISR, Fit Normalizations 
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FIG. 10: The bin-by-bin values for the relative efficiency re- 
turned by the fit. The curves have been normalized so that 
the last A(j) bin has a value of one by definition. The error 
bars on these curves indicate the statistical error on the bin 
values returned from the fit. The statistical errors for the fit 
are correlated from bin to bin. 



olution of the BVTX tagging algorithm as measured in 
MC. 

In order to minimize the effect of statistical fluctua- 
tions in the tagging efficiency determined from MC, we 
fit the tagging efficiency to an empirical function of the 
following form: 

Afreq (^^^) + PsiAcb) + Pg, (2) 

where freq is the error function. The relative efficiency 
curve resulting from this fit is shown in Fig. 10. The 
sharp step around = 60°, which is modeled by the 
error function term, comes from the transition from the 



There are two main contributions to the systematic 
uncertainty associated with the relative tagging efficiency 
correction. First, the statistical errors on the fit value for 
the relative efficiency correction factor should be propa- 
gated into systematic uncertainties on the corrected Acj) 
distribution. There is an additional systematic uncer- 
tainty that comes from the model used to calculate the 
relative efficiency correction. The Pythia MC sample, 
with high ISR and with the normalization of the differ- 
ent production mechanisms taken from the fit to the At/) 
distribution in the data, is used as our baseline for the 
relative efficiency correction. However, other models, like 
the lower ISR Pythia sample or Herwig also match the 
data to varying degrees and so could also have been used. 
To account for this ambiguity, we compare the relative ef- 
ficiency corrections from other MC models to our baseline 
model. In the worst case, the difference for the bin-by- 
bin relative efficiency correction factor is approximately 
equal in magnitude to the statistical error from the fit. 
Therefore, to account to modeling uncertainties in the 
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relative efRcicney correction, we increase the systematic 
error associated with the correction by a factor of \f2 . 

The mistag subtraction scheme used for this analysis 
relies on the assumption that 100% of legitimate tags and 
50% of mistags have positive L^y The true fraction may 
be somewhat different. For example, if most of the events 
contain at least one B hadron, then the L^y distribution 
of mistags may be biased towards positive values by the 
presence of actual displaced tracks in the events. Fur- 
thermore, the bias in L^y may depend on the topology 
of the event. To investigate any possible bias in the L^y 
distribution of mistags, we examined MC events contain- 
ing mistags identified by matching tracking information 
to MC truth information. Prom MC sample to MC sam- 
ple, the fraction of legitimate secondary vertex tags that 
have positive Lj;^ varies from 0.97 to 1.0. For mistags, 
the positive L^^y fraction varies from 0.45 to 0.55. To 
estimate the possible effect of using the wrong fractions 
when performing mistag subtraction, we redo the mistag 
subtraction in the data using different assumptions about 
the positive L^y fraction for good tags and mistags. The 
mistag subtraction formula (Eq. 1), generalized for an 
arbitrary fraction j) of good tags with positive Lj;^ and 
an arbitrary fraction q of mistags with positive L^y, is 
given by 
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Changing the positive L^y fractions from mistag sub- 
traction affects both the normalization and the shape of 
the A(j) distribution. However, we are only concerned 
about the shape for this analysis. Therefore, before we 
compare the shape of the Acj) distribution using the stan- 
dard mistag subtraction scheme to the shape obtained 
using alternative values for the positive L^y fractions, we 
normalize the distributions to imit area. To estimate the 
systematic error from mistag subtraction, we take the 
A<p distributions calculated varying the p and q values 
in Eq. 3 within their allowed ranges and fit them to the 
functional form for the A(p distribution, given below, in 
order to minimize the effect of statistical fluctuations: 



/(A0) = Piexp 
P3 exp 



1/A0\2 
P4(A(/.-7r)2+P5(A(/.) 



PqA^ + Pr 



(4) 



We then calculate the maximum deviation between the 
result from the default mistag subtraction scheme and 
the results obtained from varying the positive L^y frac- 
tions. This maximum deviation is assigned as the sys- 
tematic error on the A0 shape from mistag subtraction. 



The bin-by-bin contribution to the double-tag A(p dis- 
tribution from prompt charm is estimated primarily us- 
ing MC. The overall amounts of prompt charm and bb+cc 
double tags are estimated by comparing the relative rate 
of obtaining a double-tagged bb MC event to the rates for 
double-tagging cc and bb + cc MC events. This approach 
estimates that 2.9% (6.0%) of the tag pairs in this sam- 
ple come from cc production for electron (muon) data, 
and 1.8% of the tag pairs in both the electron and the 
muon samples comes from bb + cc production. The A(p 
shape for the cc and bb + cc contributions is estimated 
by applying the measured relative tagging efficiency as 
a function of A(j) to the generator level cc and bb + cc 
A(j) distributions. The resulting estimated contamination 
from prompt charm to the double-tag Acj) distribution 
is shown in Fig. 11. The systematic error on this cor- 
rection is estimated by performing several checks on the 
data. One check involves comparing the Acj) spectrum for 
double-tagged events in which the invariant mass of the 
tracks for each tag is greater than 2 GcV/c^ to the spec- 
trum when both tags have an invariant mass less than 
2 GeV/c^. The former sample is enhanced in bb content 
relative to prompt charm, while the latter sample has 
a greater contribution from prompt charm. Both sub- 
samples have far fewer statistics than the main sample. 
The Acj) shapes of these two subsamples agree within the 
statistics of the samples, suggesting a negligible contri- 
bution from prompt charm. An alternative estimation of 
the prompt charm contribution can be obtained by fitting 
the tag mass distribution to template shapes derived from 
bb events (including tags of secondary charmed mesons) 
and cc events. The results of these fits suggest a prompt 
charm contamination roughly a factor of two larger than 
the MC estimates, although still a relatively small con- 
tribution at 7.1% for the electron data and 13.3% for the 
muon data. As a result of the differences between these 
two alternate estimates of the prompt charm contribu- 
tion and the MC method used to set the normalization 
of our prompt charm correction, we set the systematic 
error on the prompt charm correction equal to the size 
of the correction in each bin. 

The MC is also used to determine the residual contri- 
bution from sequential double tags. Based on examining 
MC events in which two tags are identified to come from 
the same B decay, we determine that after mistag sub- 
traction, 25.9% of the tags removed by the 6 GeV/c^ 
mass cut were from sequential tag pairs. Furthermore, 
using Monte Carlo it was also determined that for every 
100 mistags removed by the 6 GeV/c^ mass cut, roughly 
2.41 events remained in this sample, yielding an efficiency 
for this cut of 97.6%. In the data, after mistag subtrac- 
tion, the 6 GeV/c^ invariant mass cut removes 471 tags 
from the electron sample and 598 tags from the muon 
sample. Using the numbers derived from the MC above, 
this means that of the tags removed by the invariant mass 
cut, 122.1 electron and 155.0 muon tags come from se- 
quential double tag pairs, and an estimated 2.9 electron 
sequential tag pairs and 3.7 muon sequential tag pairs 



16 



10' 



10' 



10 



1994-1995 (90^pb) 



Prompt Charm + Prompt Charm Background 
Bottom + Prompt Charm Background 



+++++++■ 



30 




Electron Data 



120 150 180 
A(|) [Degrees] 




Muon Data 



150 180 
A(|) [Degrees] 



FIG. 11: The estimated shape of the bacltground from direct 
cc production (the hatched area) and multiple-heavy flavor 
{bb + bb and bb + cc) production (the solid area). The points 
with error bars show the data. 



remain in the data after this cut. The Ac/) distribution 
of the sequential tag pairs is also determined using MC 
to be well described by the positive half of a Gaussian 
distribution with a mean of zero and a width of 0.122 
radians. To correct for the sequential double tag con- 
tribution in the data, we take the estimated number of 
sequential double tags, with a half-Gaussian distribution 
as described above, and subtract them from the Acj) bins 
in the data. The systematic error on this correction is 
set equal to the size of the correction. 



H. Final Distribution and Comments 



a Gaussian resolution function to account for our Acj) 
resolution. Finally, ignoring the small difference in ry ac- 
ceptance between the electron and muon samples, these 
two distributions can be combined to give the overall B 
hadron Acj) distribution, shown in Figure 13. Table III 
specifies the corrected fraction in each Acj> bin as well as 
the breakdown of the systematic errors for each bin. 
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Figure 12 shows the final, corrected tag Acf) distribu- 
tion, including systematic errors. To obtain this distri- 
bution, the contributions from residual sequentials and 
prompt charm are removed from the mistag-subtracted 
distributions. Then the relative efficiency corrections de- 
rived in Section III G are applied. Systematic errors from 
the various corrections are combined in quadrature to 
give the total systematic error. Mistag subtraction gives 
the largest contribution to the systematic error. The final 
corrected tag distribution provides a measurement 
of the B — B Acj) distribution where the B providing 
the trigger electron (muon) has pr > 14.0 GeV/c and 
I??! < 1.0(0.6), and the other B has pr > 7.5 GeV/c and 
I77I < 1.0, with a Acji resolution of 6.22°. This distri- 
bution can be compared to generator-level Acj> distribu- 
tions from Monte Carlo that have been convoluted with 



FIG. 12: The final, corrected Acj) distribution for the double- 
tagged electron (left) and muon (right) data. The corrections 
made to the data include mistag subtraction, sequential re- 
moval, prompt charm subtraction, and the relative tagging 
efficiency correction. The error bars display the statistical er- 
ror on the points. The filled region at the bottom indicates 
the systematic errors. The systematic errors are correlated 
from bin to bin. Mistag subtraction provides the dominant 
contribution to the systematic errors. 

From the corrected data, we can also calculate the 
fraction of tag pairs in the "towards" region, defined 
by Acj) < 90° . This fraction is of interest because 
production in the "towards" region is dominated by the 
higher order production diagrams. The towards fraction 
provides a single figure of merit to indicate the relative 
sizes of the contributions from flavor excitation and gluon 
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TABLE III: The corrected fraction of combined electron and muon data in each bin as well as a breakdown of the components 
of the systematic errors on each bin. The total systematic error is the sum in quadrature of the individual components. 



splitting. To account for correlated systematic errors, we 
calculate the tovi^ards fraction for our data by essentially 
repeating the analysis with two bins instead of thirty, 
and then taking the ratio of the "towards" bin over the 
total. For the electron data, we obtain a towards fraction 
of 29.8 ± l.i{stat.) ± 2.%syst.)%. For muon data, we ob- 
tain a towards fraction of 26.4 ± 1.7{stat.) ± 3.7(syst.)%. 
The electron and muon samples are combined to give a 
towards fraction of 28.8 ± l.Q{stat) ± 3.1(sysi)%. Ta- 
ble IV shows the uncorrected number of tag pairs in the 
"towards" and "away" bins in the data and gives the 
corrections applied to obtain the final number. Table V 
breaks down the contributions to the systematic uncer- 
tainty on the towards fraction. 



IV. J/V^-LEPTON 6 QUARK CORRELATION 
MEASUREMENT 

A. Overview 

This measurement is optimized to measure the region 
in phase space least understood in experimental measure- 



ments and theoretical predictions: small A(/) where both 
bottom quarks point in the same azimuthal direction. As 
stated previously, earlier bottom quark angular produc- 
tion measurements had little sensitivity to this region. 
A study of opposite side flavor tags using soft leptons 
for the CDF sin 2/3 measurement [36] showed a signif- 
icant number of tags at small opening angles between 
fully reconstructed bottom decays and the soft leptons. 
Figure 14 shows the sideband-subtracted A(/) distribution 
between J/tpK~^ candidates and the soft leptons. 

About 30% of the soft leptons are in the same azimuthal 
hemisphere, a fraction much larger than expected from 
parton shower flavor creation Monte Carlo (« 5% for 
Pythia flavor creation). 

This analysis uses the bottom pair decay signature of 
b — + J/ipX,h t^X. The impact parameter of the 
additional lepton and the pseudo-cr of the J/i/' are fit 
simultaneously in order to determine the 66 fraction of 
the two A(/) regions. Angular requirements that were 
necessary in previous di-lepton measurements because of 
double sequential semi-leptonic decay backgrounds(6 — » 
ci~X\ c — > f+X') are avoided by the chosen signal. is 
the only particle that decays directly into J ji]) and an ad- 
dition lepton. The only other source of candidates where 
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Electrons 




Muons 






Towards 


Away 


Towards 


Away 


Mistag-Subtracted Data 


1210 


8887 


832 


6260 


Charm Contamination 


42.1 


442.6 


52.9 


500.3 


Sequential Contamination 


2.9 


0.0 


3.7 


0.0 


Relative Efficiency Correction Factor 


0.326 


1.0 


0.376 


1.0 


Corrected Data 


3573.6 


8444.4 


2062.2 


5759.7 



TABLE IV: The number of events in the "towards" and "away" regions before and after applying corrections to the data. 





Electrons 


Muons 


Towards Fraction 


29.8% 


26.4% 


Statistical Error 


±1.3% 


±1.7% 


Mistag Subtraction Systematic Error 
Sequential Removal Systematic Error 
Charm Subtraction Systematic Error 
Relative Efficiency Correction Systematic Error 


±2.0% 
±0.05% 
±1.3% 
±1.6% 


±2.8% 
±0.09% 
±2.2% 
±1.1% 


Total Systematic Error 


±2.9% 


±3.7% 



TABLE V: The break-down of the systematic errors by contribution. The total systematic error is the quadrature sum of the 
individual components. 
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O0.125 - Fraction with A(b < 90": 

28.8 ± 1.0 (Stat.) ± 3.1 (syst)% 

Trigger B: p.j, > 14 GeV/c, |ti| < 1 
Otlier B: p.j, > 7.5 GeV/c, |ri| < 1 
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FIG. 13: The combined, corrected electron and muon 
distribution from the double-tagged analysis. In making this 
plot, we ignored the difference in 77 acceptance between the 
electron and muon triggers. The corrections made to the 
data include mistag subtraction, sequential removal, prompt 
charm subtraction, and the relative tagging efficiency correc- 
tion. The error bars display the statistical error on the points. 
The filled region at the bottom indicates the systematic er- 
rors. The systematic errors are correlated from bin to bin. 
Mistag subtraction provides the dominant contribution to the 
systematic errors. 



FIG. 14: Sideband-subtracted A0 distribution between fully 
reconstructed — > J/ipK^ and soft leptons. 



the additional lepton and J candidates originate from 
the same displaced decay are hadrons that fake leptons or 
decay-in-flight of kaons and pions. The number of events 
from Be or from 'fake' leptons can be estimated well by 
using techniques from Ref. [37]. Thus, no angle require- 
ment between the two candidate bottom decay products 
are necessary, yielding uniform efficiency over the entire 
A^'''' range. Due to the limited size of the data sam- 
ple, only ftoward, the fraction of hh pairs in the same 
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azimuthal hemisphere, can be measured. 

The selection criteria used in this analysis have simi- 
lar bottom momenta and rapidity acceptances to CDF's 
Run II displaced track(SVT) [38] and J/i/" triggers, and 
the addition leptons have momenta very similar to the 
opposite side taggers planned for Run II (opposite kaon, 
opposite lepton and jet charge flavor taggers). Therefore, 
this measurement aids in the development and under- 
standing of flavor taggers for such Run II measurements 
as the Bs mass difl'erence. 



B. Sample Selection 

The signal searched for in this analysis is b ^ 
J/ijjX,b t^X' where £ can be an electron or muon. 
In this section, the Run IB J/il) data set is described. 
The offline selection criteria for both the J/V' and the 
additional lepton are also described. 

1. J/ip — > iJi^ ix~ Selection 

This analysis uses the CDF Run lb J/il> data set ob- 
tained between January 1994 and July 1995. The CDF 
J /ijj triggers and offline J/'4> selection criteria utilized are 
the same as the Be discovery analysis at CDF [37]. In or- 
der to understand the trigger efficiencies, we confirm that 
the J/'f/' candidate's muons are the two muon candidates 
which triggered the event. 

After confirming the trigger, the position of the ex- 
trapolated track at the muon chamber is compared to 
the position of the muon stub using a matching test, 
taking into account the effects of multiple scattering and 
energy loss in material. The positions are required to 
match within 3 standard deviations in the i~4> projection 
and within 3.5 standard deviations in the r-z projection. 

Next, we require a high quality track for both muon 
candidates. The pseudo-lifetime (ct) of the J/ if) candi- 
dates is used in this analysis to determine the bottom 
purity. Therefore, SVX' information is required in order 
to improve the precision of the cr measurement. 

In order to reject J/ijj candidates with muons origi- 
nating from different primary interactions, the z position 
difference between the two tracks is required to be less 
than 5 cm at the beam-line. A vertex constrained fit 
of the two muon candidates is performed [39]. The 
probability of the vertex flt is required to be better than 
1%. The vertex constrained mass of the J/ip candidate 
is required to be 2.9 GeV < Mj/^ < 3.3 GeV. 

A total of 177,650 events pass the above selection cuts. 
Figure 15 shows the J/'0 mass distribution for these 
events. In order to estimate the number of J/V' — > M^M" 
candidates, the mass has been flt with two Gaussians 
(used to model the J/ij) signal) and a linear background 
term. A linear background has been assumed in many 
previous CDF J/ij} analyses [37, 40, 41]. The back- 
ground under the mass peak is caused by irreducible 



decay-in-flight and punch-though backgrounds, DrcU- 
Yan muons and double sequential semi-leptonic decays 
where 6 c^i'X, c -* s^i+X'. From the fit, 137780±440 
J /if) candidates are in the sample. For this measure- 
ment, the J/tp mass signal region is defined to be within 
±50 MeV of the Particle Data Group [42] world av- 
erage value (3096.87 MeV). The sideband regions are 
chosen to be 2.900 GeV < Mj/^ < 3.000 GeV and 
3.200 GeV < Mj/^ < 3.300 GeV. The sideband regions 
contain 20,180 events. The events in these regions are 
used later in the analysis to describe the cr shape of J/ij} 
background in the mass signal region. 

> 8000 ^ ^ 




2.9 2.95 3 3.05 3.1 3.15 3.2 3.25 3.3 
J/^ candidate mass (GeV) 



FIG. 15: Di-muon invariant mass distribution from events 
passing selection criteria. Top: Linear scale. Bottom: Loga- 
rithmic scale. 

The ratio between the number of background J/tp 
events in the mass signal region to the background in 
the mass sideband region (Rgide) was determined to be 
R^i^^ = 0.501 ± 0.000043 by the mass fit. To estimate 
the systematic uncertainty of this ratio, a 2nd order poly- 
nomial is used to describe the background term. The 
resulting fit value is Rside = 0.545 ± 0.008. The differ- 
ence between the two fits is taken to be the systematic 
uncertainty yielding Rside = 0.501 ± 0.044. 

2. CMUP n Selection Requirements 

The additional (non-J/^) muon is required to have 
muon stubs in both the CMU and CMP (a CMUP muon). 

Reqiiiring both CMU and CMP muon stubs maximizes 
the amount of material traversed by the candidate, re- 
ducing the background due to hadronic punch-though of 
the calorimeter. The matching requirements are the 
same as for the J/^ muons. The muon candidates are 
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required to have a pT > 3 GcV; muons with lower pT 
will typically range out prior to the CMP due to energy 
loss in the calorimeter and the CMP steel. 

As the impact parameter is used to estimate the bot- 
tom purity of the muons, the same track quality is re- 
quired as for the J/ip muons. Additionally, the muon 
candidate's track projection is required to be in the fidu- 
cial volume of the CMU and CMP. The z positions of the 
J/tp candidate and the CMUP muon are required to be 
within 5 cm of each other at the beam-line. 

In total, 247 CMUP candidate muons are found in the 
J/tp sample, out of which 51(142) CMUP candidates arc 
in events where the J/ candidate is in the mass sideband 
(signal) region. 

Of the 142 CMUP candidates with J/tp candidates in 
the J/tp mass signal region, 64 events have the CMUP 
muon and the J/ijj candidate in the same hemisphere in 
the azimuthal angle (which will be known as toward); the 
other 78 events have the CMUP muon and J/ ip candidate 
in the opposite hemisphere in the azimuthal angle (which 
is denoted away). 



3. Electron Selection Criteria 

A method of the finding soft (relatively low momenta) 
electrons was developed for bottom flavoring tagging in 
CDF's -Brf mixing and sin(2/3) measurements [30, 36]. 
These electrons have a relatively high purity, a trans- 
verse momenta greater than 2 GeV, and an understood 
efficiency. The rate of hadrons faking an electron was 
studied extensively in Rcf. [37], making it possible to es- 
timate the background due to hadrons faking electrons. 
The selection criteria is identical to Ref. [37] in order to 
use the same fake rate estimates. 

The selection criteria requires a high quality track 
which is consistent with an electron in the various de- 
tector systems. Information on the energy (charge) de- 
posited, the cluster location, and track matching vari- 
ables are all used in order to reduce the rate of a hadron 
in the detectors' fiducial volume faking an electron to 
(6.4 ±0.6) X 10-^. 

One source of electron background is photon conver- 
sions, where a photon interacts with detector material 
and converts into a e+e~ pair. In addition, 1.2% of all 
neutral pions decay into 7e+e~ directly (Dalitz decay). 
To reduce this background, conversions are searched for 
and vetoed by looking for a conversion partner track. 

The conversion requirements are the same as Ref. [30]. 
Unfortunately, the conversion removal is not totally effi- 
cient. Therefore, some of the soft electron candidates are 
residual conversion electrons, where either the conver- 
sion pair track is not found due to tracking inefficiencies 
at low pt or the conversion electron selection is not fully 
efficient. The rate of residual conversions is studied more 
in section IV C 4. 

In total, 514 candidate electrons are found after con- 
version removal; 92 events have the J/ tp candidate in the 



mass sidebands and 312 events have the J /ip candidate 
in the mass signal region. In the J/tp mass signal region, 
107(205) of the events are in the toward(away) regions 
in A^. In the J/tp mass signal region, 6(9) events were 
vetoed as conversions in the toward(away) bin. In 
the J/tp mass sideband region, 5(4) events were vetoed 
as conversions. 



C. Signal and Background Description 

The signal and backgrounds for both the J/tp + n 
and J/tp + e samples are very similar. The basic tech- 
nique to determine the amounts of the various signal and 
background components is with a simultaneous fit of the 
pseudo-cr (defined in section IV C 1) of the J/tp and the 
signed impact parameter of the non-J /tp lepton. The im- 
pact parameter is signed to distinguish between residual 
electron conversions and electrons from bottom decay, 
as described in section IV C 5. The impact parameter is 
signed positive if the primary vertex lies outside the r-(/> 
projection of the particle's helix fit. 

As the J / tp and additional lepton originate from sepa- 
rate bottom hadron decays, the impact parameter of the 
additional lepton and the cr of the J/ tp are not strongly 
correlated for the signal. The backgrounds in this anal- 
ysis have two categories: one in which the impact pa- 
rameter and CT are uncorrelated, and other where the 
impact parameter and ct are strongly correlated. The 
impact parameter and the ct become strongly correlated 
when both the J/tp and the additional lepton candidate 
originate from the same displaced vertex. 

In uncorrelated sources, the impact parameter and cr 
shapes describing the background are determined inde- 
pendently. J/tp candidates are assumed to originate from 
three sources: direct J/tp production (including feed- 
down from Xci, Xc2, and tp}[2s)) where the J/tp decays 
at the primary vertex, J/tp from bottom decay (includ- 
ing the feed-down from higher cc resonances), and the 
non-J/tp background described by the events in the J/tp 
mass sidebands. Lepton candidates are assumed to orig- 
inate from the following sources: directly produced fake 
or real leptons from the primary vertex, leptons from 
bottom decay (including b cX iX'), lepton candi- 
dates with the fake J/tp candidate, and residual conver- 
sion electrons. 

In addition, two correlated sources of backgrounds ex- 
ist. The first source is Be J/tjjC+X, which is a small 
but irreducible background. The impact parameter of 
the additional lepton and the ct of the J/ tp is described 
by Monte Carlo techniques and the overall size of the 
background is also estimated (see section IV C 7). 

The other correlated source of background occurs when 
a bottom hadron decays into a J /tp and a hadron which is 
misidentified as a lepton. For electrons, this background 
is due to hadrons (mostly tt^ and K"^) showering early 
in the calorimeter and passing the electron identification 
selection. For muons, there are two sources of this back- 
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ground. The largest source of correlated background is 
due to decay-in-flight of charged pions and kaons, which 
result in a real muon. These real muons are denoted as 
'fakes' in this analysis. The other, smaller correlated fake 
muon background is caused by hadrons punching through 
the calorimeter and muon steel shielding. These back- 
ground sources are more fully described in section IV C 8. 

The following sections provide a description of the 
techniques used to determine the impact parameter and 
CT shapes of the various sources and to estimate of the 
number of residual conversions, B^. — > J/tjj£X events, and 
b — > J/iplfakeX events in the sample. 

1. J/tp CT Signal and Background Distributions 



can be determined. The shape of the directly produced 
J/ip (Fdirecti^)) parameterized by a Gaussian with two 
symmetric exponential tails; this shape is the assumed 
resolution function of the cr measurement. The shape 
of J/tp events from bottom decay {F§'^{x)) is therefore 
described as a positive exponential convoluted with the 
CT resolution function. The background shape {gback) is 
fixed to the value obtained in the fit of the sideband re- 
gion and the background fraction {Jback) is fixed to the 
value predicted by the J/^ candidate mass fit. Figure 16 
shows the fit result of the signal region. The fit aver- 
age bottom proper decay length of 442 ± 5 /im is consis- 
tent with previous measurements at the Tevatron [40, 41]. 
The fit yields a bottom fraction of 16.6% ±0.2% or equiv- 
alently 22150 ± 270 J/ip from bottom decay. 



The direct J/V' and bottom decay J/tp ct shapes are 
determined from a fit to the data, using a technique pre- 
viously establish in Ref. [40, 41]. The relatively long av- 
erage lifetime of the bottom hadron (1.564±0.014 ps [42]) 
allows one to distinguish between these two sources. 
First, the signed transverse decay length L^y is deter- 
mined in the r-(^ plane. 



{Xsv - Xpv^ ■ 



(5) 



where Xgy and Xpy are the locations of the J/i/i ver- 
tex and the primary vertex in the transverse plane, and 
Pr^^^ is the vector transverse momentum of the J/tp. Di- 
rectly produced J/ip have a symmetric L^y distribution 
around zero and bottom J/tp events will predominately 
have a positive sign. If the bottom decay was fully recon- 
structed, one could determine the proper decay length 
exactly (cTproper) from the measured L^y and px- Be- 
cause the bottom hadron is not fully reconstructed, a 
'pseudo-proper decay length' (cr) is constructed using 
the kinematics of the J/t/j only: 
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FcorriPT^), determined in Ref. [40], is the average cor- 
rection factor for the partial reconstruction of the bottom 
hadron. 

The events in the J/tp mass sidebands are used to 
model the fake J/tp background under the J/tp mass sig- 
nal peak. Two components are fit using an unbinned 
log-likelihood technique: events from the primary vertex 
(direct) and events with lifetime from heavy flavor (pre- 
dominantly from b cii~X jjL'^X'). The direct events 
are described by a symmetric resolution function chosen 
to be a Gaussian plus two symmetric exponentials. The 
events with lifetime are fit with a positive only exponen- 
tial. 

Once the ct shape of the mass sideband events is found, 
the CT shapes of directly produced and bottom decay J/ tp 
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FIG. 16: Fit of J/tp signal region. 



2. Lepton Impact Parameter Signal Distributions 

The impact parameter shape of bottom decay leptons 
is determined by Monte Carlo simulation, using the pre- 
scription from Ref. [20] for parton shower Monte Carlo 
programs. In this prescription, separate samples of fla- 
vor creation, flavor excitation, and gluon splitting events 
are generated and then combined with the relative rates 
predicted by the Monte Carlo programs. In appendix B, 
the generation of the simulated events is described. 

Figure 17 shows the unsigned impact parameter for 
direct bottom electrons (6 ce^) and sequential charm 
electrons (6 ^ c — > se"*") in the flavor creation sample. 
The impact parameter distributions are very similar and 
cannot be fit for separately. The uncertainty on the rela- 
tive rate of these two sources is one of the systematic un- 
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certainties treated in section IV Dl. The bottom decay 
impact parameter shape is fit with two symmetric expo- 
nentials and a Gaussian. The fit to the simulated bottom 
impact parameter distribution for electrons is shown in 
fig. 18. 
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FIG. 17: The impact parameter of events witfi electrons pass- 
ing requirement in the flavor creation Monte Carlo sample. 
Top: Direct bottom. Bottom: Sequential charm. 



3. Lepton Impact Parameter Background Distributions 

In previous analyses [11, 12], the impact parameter 
distributions for particles originating at the primary ver- 
tex were determined using jet data. Unfortunately, any 
data sample will have low level contamination of heavy 
flavor (charm or bottom) at the « 0.1 — 1% level, which 
is larger than the non-Gaussian effects in the impact pa- 
rameter resolution. In this analysis, the impact parame- 
ter shapes of directly produced particles are determined 
with a Monte Carlo technique. 

Pythia is used to generate the light quark and gluon 
subprocesscs, which are then passed through a detector 
simulation. To be included in the electron sample, the 
candidates must be a quality track with a > 2 GeV 
and extrapolate into the electron fiducial region. For the 
muon direct sample, the candidates must have a quality 
track with a > 3 GeV and extrapolate into the CMUP 
muon fiducial region. The simulation events which pass 
the selection criteria are fit with two symmetric expo- 
nentials and a Gaussian. The fit shapes are then used as 
input to the likelihood fit for particles originating from 
the primary vertex (direct tracks). 
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FIG. 18: The signed impact parameter distribution for the 
combined electron bottom Monte Carlo. Top: Logarithmic 
scale. Bottom: Linear scale. 



4- Size of Residual Conversion Background 

One obvious source of electron background is resid- 
ual conversions left in the sample due to the inefficiency 
of finding the conversion pair. In order to estimate the 
number of residual conversions, a technique similar to 
Ref. [43] is used. It is assumed that there are two in- 
dependent causes for the lack of removal of a conversion 
electron: the track pair is lost due to tracking inefficien- 
cies at low momenta or the selection requirements arc not 
fully efficient. By measuring these two efficiencies and 
the rate of conversion removal with the chosen conver- 
sion selection requirements, one determines the residual 
electrons {Nresid)- The conversion electron that passes 
the electron identification criteria is denoted as the con- 
version candidate, and other electron that did not pass 
the electron identification criteria is denoted as the pair 
candidate. 

The number of residual electrons is equal to: 



1 



{cut) 



1 



(7) 



where Ntag is the number of the conversions removed, 



■Apt) 



^cnv {cut) is the conversion finding efficiency, and Cc 
is the tracking efficiency of the conversion pair. 

The efficiency ec„v {cut) is measured using different sets 
of conversion requirements, the tight (standard) and a 
loose set of cuts. Assuming that the loose cuts are fully 
efficient, the ratio of conversion pairs fit with tight and 
loose cuts yields ecnv{cut) = 72.3±6.5%. In order to test 
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this assumption, 2 additional wider sets of cuts are used 
which yield no extra conversion candidates. 

The tracking efficiency of the pair candidates 
i^cnviPT)) is estimated with a Monte Carlo technique 
similar to Ref. [37] . The generation of the simulated con- 
versions is detailed in Appendix B. ecnvipr) is estimated 
by comparing the pt distribution of the conversion part- 
ner in the simulation sample to the pt distribution of 
the conversion partners in the data sample. The simu- 
lation is normalized to the data in the pt range where 
the tracking is fully efficient {pr > 0.5 GeV). The ratio 
of the number of events seen in data versus the number 
of normalized conversion candidates in simulation gives 
^cnviPr) = 69±5(stat)±9(syst)%. The systematic error 
includes the uncertainty in the conversion's pT spectra. 

The ratio of the number of residual to found con- 
versions was found to be Rconv = 1.00 ± 0.38. The 
conversion veto removes 6(9) electron candidates in the 
toward(away) Acf) region. Thus, approximately 6.0 
(9.0) residual conversion are in toward(away) Acf) region. 
About 5% of the electron candidates are residual conver- 
sions and have to be included in the cr-impact parameter 
fit. A total of 9 conversions (4 toward, 5 away) are found 
in events with the J/tp candidate in the mass sideband 
region. 



5. Impact Parameter Shape of Residual Conversion 
Background 



which silicon hits are falsely added (mostly at high eon- 
version radius, Rconv > 6 cm). The relative fractions of 
the two components are determined by data. The frac- 
tion of conversions with Rconv < 6 cm [fco'nv^^'^) in 
data and simulation are matched, with the uncertainty 
in the fraction in data included as a systematic uncer- 
tainty in the shape. 

Figure 19 shows the impact parameter of the candi- 
dates found in data and the combined conversion impact 
parameter shape normalized to data. The combined im- 
pact parameter shape describes the data adequately, in- 
cluding both the negative tail and large positive tail. 




Impact Parometer (cm) 

FIG. 19: The signed impact parameter distribution for con- 
versions found in data. Solid line: Monte Carlo fit using the 
central value of f^oZf^^^- Dashed line: Monte Carlo fit in- 
creasing f2i°nv^^'^ by one sigma. Dotted line: Monte Carlo 
fit decreasing /Sn'?f '^^^ by one sigma. 



For conversion electrons from a primary photon, the 
primary vertex alway lies outside of the helix projection 
with perfect tracking. To distinguish between conver- 
sions and bottom decay electrons, the impact parameter 
is signed such that the impact parameter is positive if 
the primary vertex is outside the projection of the 
track's helix and is negative otherwise. Conversion elec- 
trons are positively signed, and Dalitz decay electrons 
and bottom decay electrons are equally negatively and 
positively signed. 

The vast majority of the conversion candidates are 
signed positive as predicted (shown in fig. 19). Unfor- 
tunately, there is a large positive tail which is not con- 
sistent with the number of silicon hits assigned to the 
track. Since at least 3 SVX' hits are required, one would 
expect the conversion candidates either to originate from 
the first two silicon layers or the beam pipe, or to be a 
tt" Dalitz decay from the primary vertex. These sources 
would produce conversions with a impact parameter less 
than 0.04 cm. Therefore, a fraction of the conversion can- 
didates must have mis-assigned silicon hits and originate 
outside of the SVX'. The measured conversion radius for 
25 of the 62 conversion candidates is greater than 6 cm, 
which is outside of the second silicon layer. 

To construct an impact parameter shape for residual 
conversions, two Monte Carlo samples are generated: a 
sample in which the silicons hits are correctly assigned 
(low conversion radius, Rconv < 6 cm) and a sample in 



6. Additional Lepton Impact Parameter Distributions in 
Events with Fake J/tp 

Events in which the J/ijj candidate is in the mass side- 
band regions are used to describe the impact parameter 
shape of leptons in events with a fake J/tp. The compo- 
sition of events in the J/ tp mass sidebands are unknown; 
therefore, the shapes have to be fit in a similar manner 
as section IV C 1. 

For the muon sample, there is no knowledge of the 
contributions to the additional muon's impact parameter 
distribution. Therefore, the shape is parameterized with 
a Gaussian and symmetric exponential. 

The impact parameter shape in the electron sample 
for events in the J/tjj mass sideband has an additional 
complication; the sample contains residual conversions. 
The number of found conversions in the signal region 
is used as a constraint on the number of residual con- 
version events in the signal region. Thus, the number 
of residual conversion events fit in the sideband region 
has to be known. In the sidebands, the residual conver- 
sion fraction is fit for fconv ~ , where rconv 

^* sideband 

and Uconvside are the fit ratio of residual to found conver- 
sions and fit number of "found" conversions, respectively. 

These quantities are constrained by the estimate of Rconv 
and the number of found conversions in the sidebands, 
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Nconvside- Sincc Tconv IS a componcnt of the signal re- 
gion fit, the signal and sideband regions have to be fit 
simultaneously. 

7. Be -» JI'>l)tX Background 

Be decay is the only known process that yields a lepton 
and B, J/ip from the same displaced vertex. CDF's mea- 
surement [37] of the j . ^^\%'+^^j/'ipK) ^-"^'^ 
time allow one to estimate both the number and impact 
parameter-cr shape of this background. The estimated 
number of Be Jf-ipiX events in the samples is used as 
a constraint in the fit. 

Taking into account correlated uncertainties, the num- 
ber of Be J/tpiX events estimated in the sample are 
N^^ = lO.Otli and N^^ = 7.2^^ According to Monte 
Carlo, over 99% of the Be passing the selection require- 
ments have A(j) < | between the lepton and the J/tp- 
Thus, this background is included in the fit only for the 
toward Ac/) region. 

Using a, Be ^ J/tpiiy Monte Carlo sample described 
in appendix B, the impact parameter-cr shape is deter- 
mined. The shape takes into account the correlation in 
the impact parameter of the addition lepton and the cr 
of the J/tp. As the ct of the J/ip increases, the impact 
parameter of the additional lepton can be larger. 

8. b —> J/'ipifake Background 

The other source of background where the impact pa- 
rameter and CT are correlated is bottom hadrons decaying 
to a real J/tp with a hadron from the same decay fak- 
ing an lepton. The sources and rates for faking Icptons 
were studied extensively in Ref. [37] and are used in this 
analysis. In Monte Carlo, more than 99% of the events 
have the J/tp and the fake lepton candidate in the same 
azimuthal hemisphere, and therefore this background is 
included only in the toward Acp region. 

The estimates of the amount and shapes of these 
backgrounds are made using a Monte Carlo sample of 
b J/tpX events, described in appendix B. Hadrons can 
"fake" an electron by showering early in the calorimeter. 
"Fake muons" can be caused by decay-in-flight of charged 
pions and kaons, and by hadrons not being completely 
absorbed in the calorimeter and leaving hits in the muon 
chamber. 

The rate that a hadron will fake an electron was stud- 
ied in Ref. [37]. The number of 6 — * J/tpefakeX events 
is determined by using these fake rates and the Monte 
Carlo sample. To be included in the fake rate calcula- 
tion, an event must pass the J/tp selection and have a 
charged hadron in the electron identification fiducial vol- 
ume with a track with SVX' hits and a > 2 GeV. 
Ideally, one would then apply the appropriate fake rate 
for the particle's pT and isolation for each track passing 
the selection, yielding the total fake rate. The isolation is 



defined to be the scalar sum of the momenta of particles 
within a cone AR < 0.2, divided by the momentum of 
the particle in consideration. Unfortunately, the Monte 
Carlo used does not include particles from the underly- 
ing event, fragmentation, gluon radiation or the other 
bottom hadron in the event. Thus, the isolation in the 
simulation does not represent the data. A large fraction 
(~ 70%) of the events have a small isolation (/ < 0.2); 
this value of the isolation is used as a central value of the 
estimate. The fake rates using the other isolation value 
estimates are used to quantify the systematic uncertainty 
of the background estimate. 

Normalizing the Monte Carlo to the estimated number 
of 6 ^ J/tpX events in data yields a estimate of 2.85 ± 
0.03(stat.) ± 0.75(syst.) events, where the second error 
is the systematic uncertainty due to modeling (or lack 
thereof) of the track isolation. 

The impact parameter-cr shape of this background is 
determined by a fit to the Monte Carlo. The Monte Carlo 
sample used in the fit consists of events which can in- 
cluded in the electron fake rate calculation. 

Decay-in-flight (DIF) of charged pions and kaons to 
muons is also a source of correlated background, as long 
as the track is reconstructible. The probability of a 
decay-in-flight to have a reconstructible track is greatly 
reduced by the SVX' requirements, as shown in Ref. [37]. 

The number of correlated background events from 
decay-in-flight is determined in a manner similar to the 
fake electron estimate. The Monte Carlo is normalized 
in the same manner as the fake electron calculation. The 
J / tp candidate is required to pass the selection criteria in 
section IV B 1, and the decay- in-flight candidates are re- 
quired to have a SVX' track with px > 3 GeV and project 
into the CMU and CMP fiducial volumes. The probabil- 
ity of decaying-in-fiight is determined for the given px 
and particle species. In Monte Carlo, 64.2 ± 0.3% of the 
particles passing the requirements are kaons. The decay- 
in-fiight background is estimated to average 9.9 ± 2.1 
events. The error includes the 12% Monte Carlo calcu- 
lation systematic uncertainty and a 17% reconstruction 
efficiency systematic uncertainty quoted in Ref. [37]. 

In Ref. [37], the decay-in-flight estimate was done us- 
ing data. In that analysis, the kaon fraction was mea- 
sured to be (44±4.4)%. The difference between the kaon 
fraction in Ref. [37] and the simulation could lead to a 
large systematic difference, because of the difference in 
the kaon and pion decay-in-flight probabilities. To es- 
timate this uncertainty, the Monte Carlo events are re- 
weighted in order to match the kaon fraction measured 
by [37]. With the re- weighting, the estimated number of 
correlated decay-in-flight backgroimd is 8.7±2.0. The dif- 
ference between the two estimates is conservatively used 
as the systematic error, yielding a final decay-in-flight 
estimate of N'/,'^'^ = 9.9 ± 2.4. 

The impact parameter-cr shape of the decay-in-flight 
background is determined by the same Monte Carlo sam- 
ple. In Ref. [12, 37], it is shown that the impact parame- 
ter distribution of reconstructible decay-in-flight particles 
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with SVX' information have the same impact parameter 
distribution as the parent particle. Similar to the fake 
electron shape, the Monte Carlo events which could be 
used in the DIF rate calculation are fit in order to deter- 
mine the DIF impact parameter-cr shape. 

Hadrons can also mimic muons by not being com- 
pletely absorbed by the calorimeter and leaving hits the 
muon chambers. The probability of a track punching- 
through the calorimeter was determined in Ref. [37] . The 
selection criteria of the punch-through estimate is the 
same as the the decay-in-flight estimate. The punch- 
though probability of the tracks passing the requirements 
is calculated from its particle type and momentum, yield- 
ing the final estimate. An average 1.76 ± 0.70 punch 
though events are expected in the data, including a 40% 
systematic error used in [37]. 

As the punch-though rate is much larger for if + than 
or TT^, the large difference in kaon fraction be- 
tween [37] and Monte Carlo (shown in the decay-in-flight 
estimate) is a significant systematic shift in the punch- 
through estimate. To be conservative, we re-weight 
the data with the kaon fraction measured in Ref. [37]; 
1.23 ± 0.46 events are expected. The diff'erence between 
the two predictions is used as the estimate of the sys- 
tematic uncertainty in the prediction, yielding a final es- 
timate of the average number of correlated backgrounds 
from punch-through of A^^j^ ^ = 1.76 ± 0.88 events. In 
Ref. [44], the decay- in-flight and punch-through back- 
grounds are shown to have the same lifetime shape 
in the Be lifetime fit. The decay-in-fiight and punch- 
through backgrounds are assumed to have the same im- 
pact parameter-cr shape. 



D. Unbinned Likelihood Fit Results 

An unbinned log-likelihood fit is used to determine the 

estimated number of bb pairs and backgrounds in the two 
regions. Inputs to the fit are the J/^/i's cr, the addi- 
tional lepton's impact parameter, and the shapes and the 
background estimates described in the previous sections. 
The shapes are used to determine the sample composi- 
tions, with the background estimates used as constants. 
In appendix C, the complete details of the log-likelihood 
function is given. 

The log-likelihood (—2 ln£) is minimized for both data 
sets using MINUIT [45] . The fit parameter errors are de- 
fined by ±lcr (A£ = 1) contours of the likelihood func- 
tion using the MINOS option. The results of the fit are 
shown in table VI. In order to display the fit result, the 
log-likelihood function has been integrated in regions of 
impact parameter-cT space. As examples of the fit qual- 
ity, figures 20 and 21 show the fit results projected onto 
the impact parameter and cr axis for the electron sample 
in the away A(/) region and muon sample in the toward 
A</) regions, respectively. 
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FIG. 20: Result of the cr-impact parameter fit for the elec- 
tron sample in the away bin. Top: Projection onto impact 
parameter axis. Bottom: Projection onto ct axis. 
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FIG. 21: Result of the cr-impact parameter fit for the muon 
sample in the toward bin. Top: Projection onto impact pa- 
rameter axis. Bottom: Projection onto cr aacis. 



The toward fraction measured in the two samples are: 

J toward - 

ftoward = 19-2^5.9% 

The measurement error includes both the statistical error 



(8) 
(9) 
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as well as systematic imeertainties due to the eonstraints. 

As a test of the fitting technique, a set of 1000 toy 
Monte Carlo 'experiments' have been generated. The 
results of the study, given in appendix D, show that the 
fit results are unbiased and have proper errors. 



1. Fit Systematics 

By measuring the fraction of bottom quark pairs pro- 
duced in the same hemisphere /toward, the systematic 
uncertainties are minimized. The selection in both the 
A(p < 7r/2 and Acj) > ■k/2 regions are the same; there- 
fore, the uncertainties in the lepton selection efficiency, 
tracking efficiency, luminosity, etc. mostly cancel in the 
fraction measurement. In this section, the systematic un- 
certainty in the log-likelihood not already included in the 
fit is estimated. The estimated size of these systematic 
uncertainties arc collected in table VII. 

The sequential charm fraction (fseq) that is used in the 
bottom impact parameter shape {F^°) is derived from the 
simulation. The uncertainty in the sequential charm frac- 
tion leads to a systematic uncertainty in the determina- 
tion of , as sequential charm leptons have a larger im- 
pact parameter than direct bottom leptons. In Rcf. [12], 
the relative systematic uncertainty in fseq was studied. 
The relative uncertainty in fseq was ±19%, which is used 
in this analysis. The value of fseq is varied by ±1(7, F^^ " 
is re-fit, and then the new Fl^° shapes are used to re-fit 
f toward- The maximum differences of ±0.1% and ±0.3% 
are assigned as the systematic uncertainty for the elec- 
tron and muon samples, respectively. 

The bottom hadrons' lifetimes B^, Bs, and Ab) 
and their decay products' impact parameters are strongly 
correlated. In order to estimate the uncertainty caused 
by bottom lifetime uncertainty, two additional Monte 
Carlo samples were generated using BGENERATOR, a 
fast bb Monte Carlo that approximates the NLO predic- 
tion by Ref. [1]. All the bottom hadron lifetimes are 
shifted by ±la from their PDG values [42]. The F^'' 
shapes determined by these samples are then used in a 
re- fit of f toward- Thc estimate of the systematic uncer- 
tainty due to the bottom lifetime is chosen to be the 
greatest differences from the standard fit. The uncer- 
tainties estimated in the electron and muon samples are 
±0.3% and ±2.2%, respectively. 

Bg, B~^, and B~ have proper decay lengths of ~ 
470 /Ltm, whereas Ab has a proper decay length of 387 /xm. 
Thus, the uncertainty in the fraction of bottom quarks 
fragmenting to A^ leads to thc largest uncertainty of the 
F^^ shape. Using BGENERATOR, samples are gener- 
ated with the Af, fragmentation fraction varied by ±1(T 
from the PDG values [46] . The new Fl^'^ shapes are 
used to re-fit f toward, with the maximum difference from 
the standard fit used as the estimate of the systematic 
uncertainty, yielding a systematic imcertainty of ±0.1% 
and ±0.2% for the electron and muon samples. 



Due to the limited number of residual conversions in 
the sample, the number of conversions pairing with J/^ 
for bottom decay {ntconv) and with direct J/tp (jidconv) 
cannot be fit independently. Thus, the ratio between 
nbconv and Udconv is fixed to the fit ratio between J/tp 
from bottom decay and directly produced J/V'- In order 
to estimate the systematic uncertainty due to this as- 
sumption, the fit of the data is re-done with either ribconv 
or Udconv fixed to zero; the difference between fits are 
used as an estimate of the systematic uncertainty, yield- 
ing a systematic uncertainty of ±0.1% for the electron 
sample. 

The residual conversion shape (Fconv) is determined 
using data and simulation. In data, the conversion radii 
of the found conversions indicate that a large fraction 
of the conversion candidates have at least 1 SVX' hit 
mis-assigned to thc track. The shape Fconv is the sum 
of two shapes: F^^'^^^^-^ , which describes the shape of 
residual conversion where SVX' hits are assumed to be 
correctly assigned, and F^^^^^^ , which describes thc 
shape of residual conversion where at least 1 SVX' hit 
is assumed to be incorrectly assigned. F^^'^^^^^ and 
^cMnv^^^ are determined using Monte Carlo described 
in section IV C 5. The value of f^on°v^^^ is changed by 
±1 (7 in Fconv i^ order to estimate the systematic uncer- 
tainty due to the Fconv shape used. The maximum dif- 
ference of ±0.2% is assigned as a conservative estimate of 
the systematic uncertainty due to the residual conversion 
impact parameter shape used. 

The direct impact parameter shapes {F^°^^^^) are deter- 
mined by a fit to Monte Carlo samples in section IV C 3. 
The finite size of the Monte Carlo samples lead to an un- 
certainty in the fit parameters of the shapes. In order to 
estimate the uncertainty in f toward due the F^"^^^^ shape 
uncertainty, each parameter is fixed to a value ±1ct from 
the best fit value and the F^f^^^^ shape is re-fit. The 
new shape is then used in the impact parameter cr fit. 
The largest negative and positive differences from the 
standard fit is conservatively assigned as the systematic 
error, lo'4% for the electron sample and l^lo^ for the 
muon sample. 

The direct and bottom cr shapes are determined by a 
fit to the data. In the fit, the fraction of fake J /ip events 
f^jback\^ is fixed at the predicted fraction. In order to 
estimate the effect on f toward, the value of f^°-'^^ is change 
by ±lfT and the J/tp mass signal region cr fit is re-done. 
The resulting F^J^^^^ and F^'^ shapes are used in a re- 
fit of ftoward- The greatest difference from thc standard 
fit is chosen to be a conservative estimate of systematic 
uncertainty, yielding uncertainty estimates of ±0.015% 
and ±0.01% for the electron and muon channels. 

In this analysis, the number of events with a directly 
produced J/tp with a lepton from bottom decay {udb) 
is assumed to be zero. In order to measure thc effects 
of this assumption, a fit of ftoward is performed where 
Udb is a free parameter. The difference in this fit from 
the standard fit is assigned as the systematic uncertainty 
due to Udb- We assign a ±1.9% uncertainty to f^^^^rd 
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Fit parameter 


Electron 


Electron Constraint 


Muon 


Muon Constraint 


OO 






23.0l^J 




bcoThv 


0.6(fixed) 




N/A 




^db 

ndd 


(fixed) 
37.0+?-^ 




(fixed) 




dconv 
''^lide 


2.8+?i 


45 


N/A 
32.9t^:^ 


34 




2 O+0.7 


2.85 ±0.75 


10.7t|| 


11.7 ±2.6 










77* 

signal 


107.1 


107 


68.2 


64 


' ''conv 


5.6 


6 


N/A 




Kb 

00 

^bd 


124.7ti^;? 




43.6tiV 

8.ii«:l^ 




bconv 


1.2(fixed) 




N/A 




Kb 

Kd 


(fixed) 
49.5t^-J 




(fixed) 
16.0t^:| 




dconv 






N/A 




"'Lie 




47 




17 


signal 
^a 

' "conv 


204.9 
9.5 


205 
9 


76.8 

N/A 


78 






0.501 ±0.044 


U.OUi_Q 043 


0.501 ± 0.044 


'^conv 


QQ+0-31 
U.»»_Q 2g 


1.00 ±0.37 


N/A 





TABLE VI: Fit results and constraints for the electron and muon samples. Appendix C describes all the fit parameters in 
detail. Variables Usignai and riconv are not fit parameters but are functions of fit parameters. 



Source 
Sequential Rate 
B Lifetime 
Fragmentation Fractions 



nbc 



c 



ratio 



Residual Conversion Shape 
Direct Impact Parameter Shape 
fback (for FBi.ect and F,-) 

Ndb 



Electron 
±0.1% 
±0.3% 
±0.1% 
±0.1% 
±0.2% 

-0.4/0 

±0.02% 

±0.1% 



Total 



+0.5qy 
-0.6 /O 



Muon 
±0.3% 
±2.2% 
±0.2% 



+7.4(v 
-1.0/0 

±0.01% 

±1.9% 



-3.1/0 



2. Correction to h quark level 

At this time, no fragmenting NLO QCD calculation 
of bottom production at the Tevatron exists. In or- 
der to compare to next- to-leading order calculations, one 
must 'correct' the experimental measurement to the bot- 
tom quark level {f toward)' using similar technique as in 
Ref. [11, 12, 47]. 

The correction is: 



C. 



B^b 



fbb 

J toward,mc 

fb^J/tpxib^tY 
J toward.mc 



(10) 



TABLE VII: Summary of the estimated values of the system- 
atic uncertainty for f toward- 



and ±0.1% to f^^ward- 

The individual systematic uncertainties arc added in 
quadrature in order to determine the combined system- 
atic uncertainty. The systematic uncertainties for the 
electron and muon samples are and ^3.1%, respec- 

tively. 



.b^J/ipX;b- 
•f toward,mc 



■tY 



is the ftoward prediction by Pythia, 
where the is calculated between the J/ip and the 
additional lepton, which both meet the selection criteria. 

The quantity /toward mc fraction of bottom quarks 

produced by Pythia in the same hemisphere which pass 
the following criteria: 



> p^/''' and I < y 
and \y''^\< y 



J/M> 



28 



Sample 


Pt 


y 


Pt 


y 


FC 


6.8 GeV 


0.66 


5.3 GeV 


0.98 


FE 


7.1 GeV 


0.66 


3.8 GeV 


1.06 


GS 


6.4 GeV 


0.70 


3.8 GoV 


0.92 


Ave 


6.8 GeV 


0.67 


4.3 GeV 


0.99 




Sample 


py-" 






y'' 


FC 


7.3 GoV 


0.66 


6.6 GoV 


0.60 


FE 


7.0 GeV 


0.66 


5.8 GeV 


0.66 


CiS 


(i.G Go\' 


0.68 


5.7 Go\' 


0.58 


Ave 


7.0 GeV 


0.67 


6.0 GeV 


0.61 



TABLE VIII: 90% acceptance requirements on the bottom 
quarks decaying to a J/ or a lepton predicting by Pythia 
Monte Carlo and a detector simulation. Top: Electron. Bot- 
tom: Muon. 



6i.2 can be cither the bottom quark or antiquark. No 
requirements are made on the decay products of the bot- 
tom quarks. 

The distributions of the pT and rapidity [y) of bottom 
quarks in Monte Carlo events that pass J/ij) and lepton 
selection (shown in figure 22-23) are used to determine 
the Pt and rapidity regions for the calculation of the 

correction factor. p'^J^ is defined to be the value of the 
bottom quark transverse momentum in which 90% of the 
b quarks which decay to a J/tp have a higher momenta. 
yJ/i) is defined to be the value of bottom quark's \y\ in 
which 90% of the b quarks which decay to a J/tjj have a 
lower \y\. p^ and y^ are defined in a similar manner for 
the bottom quark that decayed into the additional lep- 
ton. Table VIII shows the value determined in both the 
electron and muon samples for three different production 
mechanisms. The rapidities of all three mechanisms are 
very similar and arc determined by the detector geome- 
try. The Pt values are different in the three mechanisms. 
Flavor creation produces two bottom quarks with simi- 
lar momenta, while gluon splitting and flavor excitation 
produce quarks with dissimilar pT- The values of y^^^, 

y^i Pt^ 1 ^'^^ Pt used in the correction factor calcula- 
tion is the average of the three production mechanisms. 
To estimate the size of the systematic uncertainty, Cs^b 
is calculated for each production mechanism separately. 
The systematic uncertainty is estimated as the largest 
difference between the individual and combined produc- 
tion mechanisms. 

Tabic IX shows the ftl^ard.mc^ ./tt^ard, mc ' ^"^^ 

Cs^b for the complete sample and the three separate 
production mechanisms. The values estimated are: 



,6 = 0.967 ±0.019(stat.)±0.088(syst.) (11) 
= 0.968 ± 0.026(stat.) ±0.061(syst.) (12) 



The measured toward fraction for the bottom quarks 
if toward) extracted using the correction factor (Cs^b) is: 
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FIG. 22: Pt of the bottom quarks in events that pass selection 
in the electron Pythia samples. The arrows indicate the 90% 
acceptance value. 
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til ± 2.3% 



(13) 
(14) 



where the first error is the fit error, the second error is the 
additional shape systematic uncertainties on f toward, and 
the third error is the uncertainty due to the correction to 
the bottom quark kinematics. 



E. Data— Theory Comparisons 

The measured toward fraction corrected to the quark 
level is compared to the NLO QCD predictions [1], 
using the same requirements as for the correction of 
the experimental mcastirements. The NLO prediction 
iftoward) made using nib — 4.75 GcV, a rcnormal- 



ization/factorization scale /x = y ml + {p^ -|-p^)/2 and 
CTEQ5M [48] and MRST99 [49] parton distribution 
functions (PDFs). To estimate the systematic uncer- 
tainty in the NLO calculation, rub is varied from 4.5-5.0 
GcV, and /i is varied from 0.5 2.0. To study the effects 
of large initial state parton transverse momenta (/ct), 
the NLO prediction is also made with (/ct) values of 0-4 
GcV. The kT effects are implemented in the same man- 
ner as in Ref. [50], where a (/ct) of 3-4 GeV per parton 
is predicted at the Tevatron. f^g!^2rd predicted using 
MRST99 and CTEQ5M PDFs, respectively, for the dif- 
ferent input values of (fcr), rnb, and ix. Table X shows 
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FC FE GS Combined 

ftowIrd,rnc 5.1 ± 0.1% 21.4 ± 0.5% 46.4 ± 0.5% 26.4 ±0.2% 

/ward.mc^^^ 5-8 ± 0.4% 23.4 ± 0.8% 47.8 ± 0.8% 27.3 ± 0.5% 

CB^b 0.879 ±0.063 0.915 ± 0.038 0.971 ± 0.020 0.967 ± 0.019 



FC FE GS Combined 

ftoZrd.rnc 3.5 ± 0.2% 19.5 ± 0.8% 47.2 ± 0.8% 25.5 ± 0.4% 

/wartmc^^^ 3-4 ± 0.5% 20.4 ± 1.3% 49.3 ± 1.2% 26.3 ± 0.7% 

CB^b 1.029 ±0.164 0.956 ±0.072 0.957 ± 0.028 0.968 ± 0.026 



TABLE IX: Correction factor between the experimental measurement and the bottom quarks. The errors quoted are statistical 
only. Top: Electron sample. Bottom: Muon sample. 
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FIG. 23: \y\ of the bottom quarks in events that pass selection 
in the electron Pythia samples. The arrows indicate the 90% 
acceptance value. 



the summary of the predictions. The NLO predictions 
do not depend strongly on the PDF selected. The mea- 
sured /jortuard ^"-"^ ttoth thc electron and muon sample are 
consistent with thc NLO prediction for values of {kx) be- 
tween zero and 3 GeV. The renormalization/factorization 
scale uncertainties in the NLO predictions and the statis- 
tical uncertainty in the measurement of f toward prohibit 
a more precise determination of (/st) from this analysis. 

Figure 24 illustrates the effects of varying the PDF, 
(fcr), and renormalization/factorization scale on the 
NLO predictions in thc electron acceptance region. Vary- 
ing mass does not affect the predicted shape, but 
instead only affects the total cross section predicted. 
The two different PDFs studied yield very similar shape 
and total cross section predictions. Only scale and 



NLO MRST99(Electron) 



{kr) = 0.0 GeV 
(fcr) = 1.0 GeV 
(kr) = 2.0 GeV 
{kr) = 3.0 GeV 
(fcr) = 4.0 GeV 



16.9% ± 0.2% (stat. 

19.4% ± 0.3% (stat. 
23.2% ± 0.4% (stat. 
31.9% ±0.6% (stat. 
44.9% ± 0.7% (stat. 



NLO CTEQ5M(Electron) 



{kr) = 0.0 GeV 
(kr) = 1.0 GeV 
{kr) = 2.0 GeV 
{kr) = 3.0 GeV 
{kr) = 4.0 GeV 



16.5% ± 0.2% (stat. 
19.1% ±0.3% (stat. 
23.1% ±0.4% (stat. 
31.7% ±0.6% (stat. 
45.1% ± 0.7% (stat. 
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PYTHIA (Electron) 



26.4% ± 0.2% (stat.) 



Data(Electron) 



18.6t^;f t°oi ± 1.7% 



NLO MRST99(Muon) 



(kr) = 0.0 GeV 
{kr) = 1.0 GeV 
(kr) = 2.0 GeV 
{kr) = 3.0 GeV 
{kr) = 4.0 GeV 



16.7% ± 0.3% (stat. 
22.7% ± 0.6% (stat. 
24.7% ± 0.6% (stat. 
32.1% ± 0.6% (stat. 
44.9% ± 0.9% (stat. 



NLO CTEQ5M(Muon) 



{kr) = 0.0 GeV 
(kr) = 1.0 GeV 
(kr) = 2.0 GeV 
(kr) = 3.0 GeV 
(kr) = 4.0 GcV 



16.8% ± 0.3% (stat. 
22.1% ± 0.5% (stat. 
23.8% ± 0.9% (stat. 
31.9% ±0.9% (stat. 
44.4% ± 1.2% (stat. 



+5,2% 
-3,6% 
+6,1% 
-7,0% 
+6,7% 
-6,5% 
+6,4% 
-6,9% 
+5.1% 
-6.6% 



(sys.) 
(sys.) 
(sys.) 
(sys.) 
(sys.) 



+4,1% 
-3,8% 
+5,6% 
-5,9% 
+ 7,4% 
-5,0% 
+5,7% 
-6,5% 
+ 7,1% 
-5,0% 



(sys.) 
(sys.) 
(sys.) 
(sys.) 
(sys.) 



PYTHIA (Muon) 



25.5% ± 0.4% (stat.) 



Data(Muon) 



33.4+°;^ +^;^ ± 2.3% 



TABLE X: Compilation of the corrected data results, the 
PYTHIA predictions, and the NLO predictions of ftoward 
Top: Electron. Bottom: Muon. 
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(fcr) variations yield appreciably different shape and to- 
tal cross section predictions. Varying the rcnormaliza- 
tion/factorization scale changes the total cross section as 
expected; lowering the scale increases the total cross sec- 
tion. In addition, varying the scale changes the predicted 
rate at large At/)'''' (> 2.9 radians) relative to the rest of 
the distribution, while the shape of < 2.9 radian region 
varies little. Varying the scale changes the relative rates 
of pp bb to pp — > bbg in the NLO prediction. Varying 
the (kr) on the other hand, changes the predicted Acf)^'' 
in a more continuous manner. With the increased num- 
ber of J/ip+£ expected in Run II, a differential azimuthal 
cross section measurement with 6-12 bins in should 
be able to separate scale uncertainty and kr smearing 
effects. 



V. CONCLUSIONS 

We have presented two new measurements of the 

A(j) distribution for bottom anti-bottom pairs produced 
through QCD interactions at the Tevatron. These mea- 
surements are specifically targeted to measure the A(j) 
distribution down to arbitrarily small opening angles 
for bb pairs produced with low transverse momentum, 
where previous measurements have lacked sensitivity. 
The small bb opening angle region is of interest because in 
this region, the higher-order bb production mechanisms 
dominate over flavor creation. The data presented here 
are consistent with other measurements and cannot be 
described solely by flavor creation. Both measurements 
indicate that a significant fraction of the bb pairs (roughly 
25%) are produced with A0 < 90°, in agreement with the 
conclusion from previous analyses [11, 20] that flavor ex- 
citation and gluon splitting play a signiflcant role in bb 
production at the Tevatron. The results of these mea- 
surements are consistent with the parton shower Monte 
Carlo models of Pythia and Herwig as implemented 
in Ref. [20] and with NLO QCD predictions. Neither 
non-perturbitive [14] nor supcrsymmetric [15] production 
mechanisms are needed in order to describe the measured 
A0 spectra. 
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Sample Name 


ISR Setting 


High 


PARP(67) = 4.0 


Medium 


PARP(67) = 3.0 


Low 


PARP(67) = 1.0 



TABLE XI: The naming convention for the three ISR samples 
for Pythia. 
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APPENDIX A: MONTE CARLO PARAMETERS 
FOR THE SECONDARY VERTEX TAG 
CORRELATION ANALYSIS 

The specific Monte Carlo generator settings used for 
the secondary vertex tag correlation analysis are specified 
below. For a more explicit discussion of the Monte Carlo, 
consult Ref. [31]. 



1. Pythia 

Version 6.203 of Pythia [17, 18] was used for this 

analysis. Flavor creation events are generated with the 
process MSEL = 5, while flavor excitation and gluon 
sphtting use MSEL = 1. The parameter PARP(67), 
which holds the value that is multiplied by the of 
the hard scatter to determine the maximum virtuality of 
the initial-state shower, was used to control the amount 
of initial state radiation in the Pythia samples. Three 
different values of PARP(67) were used and for each set- 
ting, other Pythia parameters were manipulated to give 
the best match to the CDF data. The naming conven- 
tion for the three ISR samples is given in Table XI. This 
tuning was done with the CTEQ5L [51] parton distribu- 
tion functions and different Pythia parameters may be 
required to achieve the same tuning with a different set of 
parton distribution functions. Table XII gives the values 
of the parameters used for this tuning. Parameters not 
mentioned in Table XII were left at the default values for 
this version of Pythia. 



2. Herwig 

Version 6.400 of Herwig [19] was used for this analysis. 

The flavor creation and flavor excitation samples were 
generated with IPROC = 1705, and the gluon sphtting 
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Electron Fid (pt'>6.8, lyr<0.67 p/>4.3 lyl'<0.99) 

1 
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• MRST99 m,= 4,75 GeV/i=1 kT=0 GeV 
■ MRST99 m,=4,75 GeVM=1 kT=1 GeV 
A MRST99 mt=4,75 GeV Ai=1 k,= 2 GeV 
▼ MRST99 m^=4.75 GeV^=1 GeV 

* MRST99 m„=4,75 GeV ^=1 K=4 GeV 
— Total Pythia 5.6 CTEQ3L 

- -FC Pythia 5,6 CTEQ3L 

- FE Pythia 5.6 CTEQ3L 
GS Pythia 5.6 CTEQ3L 
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FIG. 24: NLO prediction [1] of bottom spectra in the acceptance region. The PYTHIA prediction is shown as a reference. 
Top Left: MRST99 PDF varying the additional kr smearing. Top Right: CTEQ5M PDF varying the additional kr smearing. 
Bottom Left: ]VIRST99 PDF varying the renormalization scale /j. Bottom Right: 1VIRST99 PDF varying the bottom quark 
mass ms- 



sample was made using IPROC = 1500. Only one Her- left at their default values for this version. 
WIG sample was generated for each production mecha- 
nism, again using the CTEQ5L parton distribution func- 
tions. Aqcd was set to 192 McV and the CLPOW pa- 
rameter was set to 1.25 to match the observed frequency 
of B baryons at CDF. All other Herwig parameters were 



32 



Parameter 


Meaning 


PARP(67)=3. 0,4.0 1 PARP(67)=1.0 


MSTP(81) 


Multiple-parton interaction switch 


1 (Multiple Parton Interactions ON) 


MSTP(82) 


Model of multiple parton interactions 


3 (Varying impact parameter assuming 
a single Gaussian matter distribution) 


PARP(82) 


Pt turn-off when using single Gaussian model of multiple 
interactions 


1.7 1.6 


PARP(85) 


Probability that a multiple parton interaction produces two glu- 
ons with color connections to the "nearest neighbors" 


1.0 


PARP(86) 


Probability that an MPI produces two gluons either as described 
above or as a closed gluon loop. The rest of the MPIs produce 

quark-antiquark pairs 


1.0 


PARP(89) 


Determines the reference energy EO 


1800. 



TABLE XII: The table above shows the Pytiiia setting used to tune the underlying event to data for the CTEQ5L parton 
distribution set and three different initial-state radiation settings. 



APPENDIX B: J/V'-LEPTON CORRELATION 
MONTE CARLO EVENT SAMPLES 

Various simulated data sets have been necessary for the 
measurement of f toward in the J/^/i-lepton data sample. 
The follow section will detail the method of generating 
the simulated events used in the measurement. 

For the bottom impact parameter description, 
Pythia [17, 18] with the CTEQ3L [48] parton distribu- 
tion functions is used. The bottom quarks are hadronized 
using the Bowler fragmentation function [52] and us- 
ing the LUND string fragmentation model. The result- 
ing bottom hadrons are decayed using the CLEO decay 
model [35]. The events arc then passed through a de- 
tector simulation [53] and the trigger simulation. The 
same selection criteria is applied to the J/xj} candidates 
in Monte Carlo as in data. 

For the bottom decay impact parameter shape for 
muons, the muons are required to be fiducial in both 
the CMU and the CMP muon subsystems with a SVX' 
track and a > 3 GeV. 

For the bottom decay shapes for electrons, the elec- 
trons are required to be in the CEM fiducial region with 
a SVX' quality track with a f>T > 2 GeV. The efhciency 
of the electron identification criteria is simulated in the 
same manner as Ref. [37]. The CPR, the CES, and the 
CTC dE/dx selection criteria do not depend on the iso- 
lation of the electron, due to the fine segmentation of the 
CPR, the CES and the CTC. Therefore, the efficiencies 
as a function of px of the CPR and the CES selection 
derived by Ref. [54] using conversions can be used. The 
CTC dE/dx efficiency as a function of p is defined by 
the selection criteria. The rate of signal events being 
removed as conversions and the Ehad/ Eem and E/p effi- 
ciencies depend on the isolation of the track. Therefore, 
the values simulated in the Monte Carlo have to be used 
to determine the efficiency. 

For residual conversion studies and impact parameter 
shape determination, the necessary simulated sample was 

generated in the following manner. A sample of tt^ is sim- 
ulated in the detector. A soft electron conversion candi- 



date is required to have a found track with SVX' informa- 
tion in the electron fiducial region with a px > "2 GeV. 
The efficiency of electron identification requirements is 
simulated in the same manner as for the bottom impact 
parameter shape sample. 

The simulated 7r° are generated with a power law spec- 
tra for pt and a flat r] distribution. The order of the 

power law is varied in order to match the px spectra of 
the found SLT conversion candidates in data. The shape 
of the found pair candidates' px is used as a cross check of 
the power law description of the conversions. Figure 25 
shows the Monte Carlo spectra normalized to the data 
for a power law of 3, 3.5, 4, and 5. The 3.5 order power 
law describes the data well and is used for the calculation 
of the efficiency. The 3rd and 4th order power law is used 
as a estimate of the systematic uncertainty. 

The px of the pair candidates is shown in figure 26. 
The 3.5 order power law describes the shape of events 
with Px > 0.5 GeV (where the tracking is assumed to be 
fully efficient). Half the difference between the 3rd and 
4th order power law spectra is used as the systematic 
uncertainty of the estimate. 

The impact parameter-cr shape of the Be background 
was determined using a Monte Carlo sample. The Be 
mesons are generated according to the NLO fragmenta- 
tion model from Ref. [55] with a fiat rapidity spectra. 
The particles are decayed using the semi-leptonic decay 
model of Ref. [56] and passed through a detector and trig- 
ger simulation. The selection criteria used is identical to 
section IV C 2. 

Single bottom quarks are generated according to the 
next-to-leading order QCD predictions by Ref. [57] and 
fragmented using the Peterson fragmentation model [58] . 
The resulting bottom hadrons are decayed using the 
CLEO decay model [35], requiring a J/ip — > de- 
cay. The events are then passed through a detector sim- 
ulation [53] and the trigger simulation. For both the 
calculation of electron and muon fake rates, the J/tp is 
required to pass the same selection criteria as data. The 
sample is normalized to the number of J/tp events from 
bottom decay fit in data in section IV C 1. 
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FIG. 25: The pr spectra of the SLT conversion candidates. 
Top Left: 3rd order power law. Top Right: 3.5 order power 
law. Bottom Left: 4th order power law. Bottom Right: 5th 
order power law. 



APPENDIX C: J/V- LEPTON CORRELATION 
LOG-LIKELIHOOD FUNCTION 

The fit parameters will be in lower case, the constraints 
will be upper case, and the errors on the constraints (if 
applicable) are denoted A(Constraint). The superscripts 
indicate the additional Icpton used (e for electrons, /i for 
muons) and the A<p region (t for toward, a for away). For 
example, N^* is the number of Be J/tpeX background 
events estimated in the electron sample in the toward Acj) 
region. 



1. Data 

The inputs to the fit on an event-by-event basis are 
the J/tp candidate's cr and the additional lepton candi- 
date's impact parameter. In the following sections, x will 
denote the impact parameter, and y will denote the cr. 
The numbers of candidates in the J/V' mass sideband and 
signal regions in both regions are used as a constraint 
in the likelihood. In the electron sample, the numbers of 
found conversions in J/tjj mass sideband and signal re- 
gions in both A0 regions also used as a constraint. Con- 
version constraints are discussed later in their respective 
sections. 
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FIG. 26: The pr spectra of the pair candidates. The Monte 
Carlo is normalized to the data with pr > 0.5 GeV. Top 
Left: 3rd order power law. Top Right: 3.5 order power law. 
Bottom Left: 4th order power law. Bottom Right: 5th order 
power law. 



P(j^signal ? ^signal ) 



(j^signal ) 



N, 



signal 



(Cl) 



with the appropriate Usignal and Ngignai for the given 
sample and region. Usignai is not a fit parameter, 
but is a function of the other fit parameters, shown in 
section C 8. 



2. J/ipX- b IX' signal 

The shapes which are used for the h J/tpX;!) iX' 
signal are described by the fit functions in sections IV C 1 
and IV C 2. The impact parameter and cr are assumed 
to be uncorrelated. Therefore, the shape which describes 
the signal is the product of the impact parameter shape 
{F^°{x)) and the cr shape {F^'^{y)) for bottom decay. 
The parameters that are used in F^°{x) are different for 
the electron and muon fits. 

The number of bb events fit is n^j^ with the superscripts 
given by sample and A(p region. For example, the num- 
ber of bb events fit in the toward A(f> region in the electron 
sample is nfi*. The bb contribution of the shape compo- 

bb 



nent of the likelihood is given by — -^^^ • (x) • 



per 



(y) 



with the appropriate superscripts for the additional lep- 
ton type and A0 region. 



34 



3. Unconstrained, uncorrelated backgrounds 

The impact parameter-cr shapes of the three sources 
of uncorrelated backgrounds without constraints, consid- 
ered in this analysis, are constructed using the functions 
derived in sections IV C 1 and IV C 3. The fit parameters 
for these three backgrounds are: 

• riM'- the number of events with the J candidate 
and with the additional lepton candidate both di- 
rectly produced 

• ribd'- the number of events with the J/V' candidate 
from bottom decay and with the additional lepton 

candidate produced directly 

• Udb'- the number of events with a directly produced 
J/ip candidate and with an additional lepton can- 
didate from bottom decay 

where the superscripts indicate of the sample and 
region. The number of events with a directly produced 
J /ip candidate and an additional lepton candidate from 
bottom decay is assumed to be small and ridb is fixed to 
zero. This parameter is released and fit for as an estimate 
of systematic uncertainty due to this assumption. 

The shape component of the likelihood for these three 
backgrounds is assembled in the same manner as the bb 
signal. 

4. Residual conversion background 



the sideband shape component is rsidenl^^pf^^^^, where 
f£o = roo,^^,■nao„^side jg ^jjj, fractlou of J / ilj mass side- 

J conv Jy sideband . • i i 

band events where the electron is a residual conversion. 

Due to the relatively small number of residual con- 
versions, fitting all three pairing of J ftp candidate types 
with conversions is not possible. In order to constrain 
this component of the fit farther, the ratio of between 
ntconv and ndconv is assumed to be the same as the ra- 
tio between J/ip mesons from bottom decay and J/ijj 
mesons produced directly (at the primary vertex). The 
fraction of J/tjj mesons from bottom decay is fit to be 
16.6% ± 0.2% in section IV CI, yielding the relation- 
ship njj^^^ = 0.2 • nJconv an estimate of the sys- 
tematic uncertainty, njj^^^ and nj^onv fixed to zero 
in separate fits in order to probe the full range of ratio 

t/a t/a 
bconv ' dconv' 

The number of found conversions in the two A0 re- 
gions is used as a constraint on the fit of the residual 
conversions. The number of found conversions fit is the 
number of residual conversions fit divided by the ratio of 
residual versus found conversions: 

. , 'f^side'^sidc^convside fC^o'\ 

f^conv = '^hconv ~r '^dconv H 77 v^*^/ 

sideband 

The constraint using the number of found conversions is 

the Poisson probability of finding Nconv conversion can- 
didates with a mean value of number of found conversion 
fit. 

P(n N ) - ("g""")^"""" ^-(n..nn.„) (nA) 
yH'ConVT conv ) — /VI V^^J 
conv • 



The total number of predicted residual conversions is 
Rconv ■ ^conv, where Rconv is the ratio between the num- 
ber of residual versus found conversions and Nconv is the 
number of found conversions in the sample. The num- 
ber of found conversions removed from the two A(j) re- 
gions with the J/tp candidate in the mass signal region is 
N^onv = 6 and A^*o„^, = 9, respectively. In section IV C 4, 
Rconv is estimated to be 1.00±0.37, using data and Monte 
Carlo techniques. Residual conversions are assumed to 
pair with all three sources of uncorrelated J/iIj candi- 
dates: fake J/ip , directly produced J/ip, and bottom 
decay J/ip. The same value of the fit parameter rconv is 
used for all sources of J/ tp candidates that pair with the 
residual conversions. The value of Vconv is constrained as 
a Gaussian probability in the likelihood. 



^i/^conv Rconv 1 ARconv^ 



1 



v/2Ai?c 



I ( Tconv-Rconv 



The fit parameters that set the scale; for the num- 
ber of residual conversions events with the J/ip candi- 
date from bottom decay and direct production are Tlbconv 
and ndconv The parameters represent the number of 
found conversions with the J / ip candidate from the given 
source. The number of residual conversions fit from 



these two sources are r„ 



'conv ' '^bconv and Vconv ' ^dconv 

The number of residual conversions already included in 



5. Fake J/tp backgrounds 

The fake J/ ip impact parameter ct background shape 
i-^sideband) determined in section IV C 6 from a fit to 
the data for the muon sample. The predicted number 
of events for this background is the ratio between the 
number of fake J/tp events in the J/tp mass signal and 
sideband region (Rside) times the number of events seen 
in data with the J/ip candidate in the mass sideband 
regions {Ngide) for the given sample and A(p region. 

In section IV B, the ratio is determined to be Rside = 
0.501 ±0.044 from a fit of the total J/tp data sample. The 
same value for the fit parameter rside is used in both A(p 
regions in the sample, but can be different in the electron 
and muon samples. The fit value of rside is constrained 
using a Gaussian factor in the likelihood function. 



(C2) Girside — Rside, ARgide) 



1 



-if- 



'side ^side 



y 



V^ARside 

(C5) 

The corresponding fit parameter Hside, for the given 
sample and Acp region, is constrained using the Poisson 
probability of measuring Nside events for a sample with 
an average of ngide events. 



Pij^sidei N side^ — 



\Ns 



Nszde\ 



(C6) 
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The contribution of the shape component of the likc- 
Uhood is . Fside(x,y) for the given sample and 

''^signal 

A(f> region. 

The fake J/tp background component in the electron 
sample is treated differently due to the presence of resid- 
ual conversions in the background. The fake J/tp shape 
is fit at the same time as the J/iJj signal region. The 
fconv component of the fake J/tp impact parameter-cr 
shape is a composite of two variables which are con- 



where r„ 



conv ) '''convside ? 



strained, fconv — 
and Nsideband are the fit ratio of residual to found con- 
version, fit number of found conversions, and N sideband = 
^side+^side- parameter Uconvside is constrained by 
the number of conversions found in the sideband using 
the Poisson probability: 



P{n. 



convside") -^convside 



convside ) 



convside • 



l^convside 

(C7) 



where ANb^ is the positive-sided error of Nb^ if {tiBc ~ 
^bJ > 0.0, and the negative-sided error otherwise. 



7. b —^ J/ipifakeX backgrounds 

As in the Be background, b — > J/tp^fakeX background 
events are only expected to populate the toward region 
in Acf), and therefore the background is only fit for in the 
toward region in the two samples. The expected number 
is 6 ^ J/tp£fakeX events is used as a constraint to the 
likelihood in the same way as Be- 



G{nBf^,,-NB,^,^,ANB,^,J 



1 



X e 



6. Be — > J/tp£X backgrounds 

Be — > J/tpiX background is predicted to only popu- 
late the toward region in A(t>. The c^xpectcd number of 
Be events is constrained in the likelihood as a Gaussian 
probability factor. 

1 1 / "-bc-"bc y 

G{nB^-NB^,ANBj = ^^—-e '^"^^ J (C8) 



8. rieignai sums 

The number of events fit in the J mass signal region 
is a function of fit parameters described previously in 
this section. Listed below are the functions for number 
of events fit for the two samples and Acp regions. 
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9. Impact pEirameter— cT shape component gions. As a reminder, x is the additional lepton candi- 

date's impact parameter and y is the J/tp candidate's 

The complete functions for the shape components of 
the fit are listed below for the two samples and A<p re- 



(CIS) 



(C14) 



36 



1 



n' 



pe,t 



shape 



Fdirectiy)Fdirecti^) ''"side''^sideFsidei^ ' V) 

1 



n 



pie, a 
shape 



.e, nl:^Fr{y)Fnx) + nliFt^'\y)F2t;Ux) + n^£F^,];Uy)F,^°'%x) 

sigjial 

+nfaKl;Uy)Fd^;Ux) + n^^^^^^ 

'^''^side'^^sideFsidei^yy) ''^convn'bconvFconv{x)F^^ ' {y) + Tconv1^dconvFconv{x)F^^^^^^{y) 
1 



n^F^^^\y)Ft°^\x) + nl';F^^'%y)F^^;:jx) + n^',"i^,T4,(y)C'^(a: 



ido,ef 



signal 
e.a T-icr,e 



^dd Fdirectiy)Fdirect^x) + '''side^sldeFsidei^' 2/) + 1'convn'liconvFconv{x)F^ ' (?/) 



+f'convndconvFconv {x)Fdirect 



(C15) 

(C16) 

(C17) 
(C18) 
(C19) 

(C20) 
(C21) 
(C22) 



10. Bin constraints component 

The constraints which arc specific to a given region in 
A(/) and sample are Usted below: 
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11. Global constraints component 



hood function for the muon sample is: 



The global constraints are the simplest component of 
the likelihood. The functions of the global constraints 
are listed below: 
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12. Log-likelihood function 



Finally, the log-likclihood can be assembled from the 
functions developed in the previous sections. The likeli- 
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global XX 



bin 
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where x^j and yf ^ are the impact parameter of the addi- 
tional muon c;andidatc and the ct of the J/ijj candidate 
for J*'* event in the i"* A(p region in the J/t/j signal region. 

The likelihood function for the electron sample is sim- 
ilar to the muon likelihood. The electron likelihood in- 
cludes conversion terms and the fit of the J/ijj mass side- 
band region. 
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bin ri (y-^shapei^ij ^ ytj^^ 



^sideband 
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k 

where x^j and yf j are the impact parameter of the addi- 
tional electron candidate and the cr of the J/ tp candidate 
for j*'* event in the z*'* Ac^ region in the J/ tp signal region, 
and x| and are the impact parameter of the additional 
electron candidate and the cr of the J/ip candidate for 
fc*'' event in the J/ip sideband region. 



APPENDIX D: TESTS OF J/V^-LEPTON 
CORRELATION LOG-LIKELIHOOD FIT 

The impact parameter-cr likelihood fit is tested using 
a large set of toy Monte Carlo samples. First, the input 
means for the various fit components are chosen to be 
similar to the results in data. The constrained terms are 
chosen to be consistent with the constraint. These inputs 
arc Poisson fluctuated to determine the composition of 
each sample. Each event is assigned an impact parameter 
and cr according to the shape function used to describe 
that type of event . 

Next, the fit constraints not yet varied {Rside, Rconv, 
^Bj^k.^ ^B^, Nlonv^ ^conv^ and Nconvside) are fiuctu- 
ated using the appropriate statistic. The fluctuated con- 
straints are then used in the fit of the toy Monte Carlo 
sample. 



A total of 1000 samples arc generated and fit for both 
the electron and muon samples. The fit values are not 
forced to be non-negative. The pull is calculated for each 
fit value relative to the non-fluctuated input quantities. 
The pull is equal to width of the distribution where 
n is the fit value, (7„ is the fit error returned, and /i is the; 
average value of the parameter input. The bias, which 
is the mean of the distribution, is also measured. 
Finally, the average difference bet ween th e fitted value 
and input parameter is calculated, (x — x). 

The pulls, biases, and average differences for all vari- 
ables are acceptable for both test samples. All pulls are 
within ±6% of 1 and all biases arc within ±0.12(t of 0. Al- 
lowing the likelihood to have negative components yields 
fit results with meaningful fit values and errors. 

Figure 27 shows the minimum log-likelihood distri- 
butions of both samples. We find that 19.6% of the 
muon toy Monte Carlo samples and 49.8% of the electron 
toy Monte Carlo samples have a higher minimum log- 
likelihood than the data. The minimum log-likelihood 
distributions along with the biases, pulls, and average 
differences give confidence that the likelihood is working 
properly. 

The muon toy Monte Carlo samples have an input 
mean of flZZds = 34-5% and a fit mean of = 
34.5 ± 0.4%. The width of the fit /toward distribution is 
10.9 ± 0.3%, which is consistent with the error seen in 
data of t8;2%- The electron toy Monte Carlo samples 
have an input mean of = 19.2% and a fit mean 



towa7'ds 

18.6 ± 0.2%. The width of the fit ftoward 
distribution is 6.0±0.1, which is consistent with the error 
seen in data of 1^ 0%. 
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FIG. 27: The minimum log-likelihood distributions of the toy Monte Carlo Left: Electrons Right: Muons. 
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